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METHODS FOR INCREASING PROTEIN EXPRESSION 

Field of the Invention 

This invention relates to expression of heterologous polypeptides. More 
specifically, the invention discloses novel recombinant host strains comprising high 
copy number cloning plasmids comprising a gene encoding the heterologous 
polypeptide. Expression of the heterologous polypeptide is effectively controlled 
by locating the regulator gene, which controls expression of the heterologous 
polypeptide, on the host chromosome and under control of a strong promoter. 

Background of the Invention 

Methods for recombinantly-producing heterologous polypeptides are well- 
known (see, e.g., Watson et ah, Recombinant DNA. 2nd ed., (1992)). The most 
commonly used regulated heterologous gene expression systems in, for example, 
Escherichia coli host cells are those based on Lacl-mediated repression of a promoter 
which controls expression of the heterologous polypeptide. Release from repression 
occurs through addition of an inducer (such as isopropylthiogalactoside (IPTG)), 
which binds to the repressor, or through an excess of DNA binding sites titrating the 
repressor. 

Most commonly used host cells cannot control the expression of the genes 
cloned onto high copy number plasmids because the number of DNA binding sites 
on the plasmids exceeds the level of available repressor. If cloned products are 
deleterious or disadvantageous in a cell population, cells either die or mutations 
inactivating the heterologous gene accumulate. 

The Ptac promoter in E. coli is 3-fold stronger than the Pl ac promoter when 
fully derepressed. Therefore, it is frequently used for promoting high level 
regulated gene expression in E. coll However, while the Pi ac promoter is 1,000-fold 
repressed by Lad, the Ptac promoter is only 50-fold repressed (Lanzer, M. & H. 
Bujard. 1988. Proc. Natl. Acad. Sci. USA. 85:8973). Repression of the E. coli P tac 
promoter (De Boer, et al., 1983. Proc. Natl. Acad. Sci. USA. 78:21-25) or other tec- 
related promoters, depends upon the concentration of the repressor, Lad. As set 
forth above, release from repression can occur through addition of inducer which 
reduces the affinity of the repressor for its specific DNA binding site, or through a 
reduction in the concentration of the repressor, relative to the molar concentration of 
specific DNA binding sites on the plasmid. If the Lad gene is located on a high copy 
number cloning plasmid, a large amount of inducer is required to initiate expression 
because of the large amount of repressor produced in such a system. 
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The control of P tac on a multi-copy plasmid is often mediated by Lad - 
expressed from a lad promoter mutation on the chromosome, designated 
/« C /Q(sometimes referred to as laciq or variations thereof herein) , in a system 
without a corresponding lad gene on the expression plasmid. The laclQ mutation is 
5 a single CG to TA change at -35 of the promoter region of lad (Calos, M. 1978. 
Nature 274:762) which causes a 10-fold increase in Lad expression (Miiller-Hill B 
et al. 1968. Proc. Natl. Acad. Sci. USA. 59:1259). Wild-type cells have a concentration 
of Lad of 10"» M or about 10 molecules per cell, with 99% present as a tetramer 
(Fickert, R. & B. Miiller-Hill 1992. J. Mol. Biol. 226:59). Cells containing the ladQ 

10 mutation contain about 100 molecules per cell or 10" 7 M Lad. 

The plasmid pBR322 is present in 39-55 copies per cell, depending on the 
growth rate of the cells (Lin-Chao, S. & H. Bremer. 1986. Mol. Gen. Genet. 203:143) 
Therefore, pBR322-based plasmids should be controlled by chromosomal tociO. 
Plasmids based on pUC, having a higher copy number (500-2,000 copies per cell), 

15 should not be controlled by ladQ alone, and expression should be leaky. Other ' 
systems have been developed, such as the T7 system of Studier and Moffatt (1986 J 
Mol. Biol. 189:113-130 and Dubendorff and Studier. 1991. J. Mol. Biol. 219:45-59), or 
involving ladQ overexpression on a plasmid (Law et al., 1993. J. Mol. Biol. 230:161- 
173), to manage the expression of genes which require tight control of expression 
20 before induction. 

However, in cases where the repressor gene is on a high copy number 
plasmid comprising the heterologous gene, the repressor is over-expressed relative 
to the amount needed to control expression of the heterologous gene controlled 
thereby (i.e., the stoichiometry is incorrect). As set forth above, this system requires 
a large amount of inducer (e.g., IPTG) to be added to the host cell during 
fermentation, which is costly and not efficient. On the other hand, when the 
repressor gene is located on the chromosome and not on the high copy number 
plasmid, the amount of repressor produced may be too little to control expression of 
the heterologous gene on the plasmid, resulting in leaky control of expression. For 
example, although ladQ results in a ten-fold higher level of expression of Lac 
repressor, this is not sufficient for tightly regulated expression from P tac on a high 
copy number plasmid if ladQ is located on the chromosome. 

In response to these and other problems, the present invention provides 
recombinant host strains for expressing heterologous polypeptides, comprising high 
copy number cloning plasmids comprising a gene encoding a heterologous 
polypeptide. Expression of the heterologous polypeptide is effectively controlled by 
locating the regulator gene, which controls expression of the heterologous ~ 
polypeptide, on the host chromosome and under control of a strong promoter. A 
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specific embodiment relates to an allele of lad capable of repressing expression-of 
genes from toe-regulated promoters on high copy number plasmids. lacVfi 
(sometimes referred to as laclQ 1 or other variations herein) is a mutation of the 
repressor's promoter (Carlos and Miller. 1981. Mol. Gen. Genet. 183:559-560) which 
results in -170-fold more Lad than the wild type promoter and compared to a 10- 
fold increase reported for latf. The increased expression of Lac repressor allows for 
repression of gene expression on the high copy number plasmid. 

The specific embodiment provides the following representative advantages 
over the prior art: 

° Ideal for all standard blue/white cloning and selection systems; 
° Low level of Plac and related promoter expression in the absence of 
inducer; 

° Safer for cloning deleterious gene products; reduces selective 

disadvantage of the cloned gene; 
° Induction is less damaging to cells than the T7 expression system of 

Studier and Moffatt (Dubendorff and Studier. 1991. J. Mol. Biol. 219:45-59) 

(Studier and Moffatt. 1986. J. Mol. Biol. 189:113-130); 
° Low inducer (IPTG) levels are required to induce gene expression; 
o Expression level is titratable by IPTG concentrations for ideal expression 

levels; and 

° lactf 1 is easily and rapidly detected by a PCR test. 

Summary of the Invention 

The present invention relates to methods and host cells for expressing 
heterologous polypeptides. 

More specifically, the invention relates to recombinant host cells comprising 
high copy number cloning plasmids comprising a gene encoding a heterologous 
polypeptide. 

More specifically, the invention relates to a prokaryotic host cell for 
producing a heterologous polypeptide comprising a high copy number plasmid 
comprising a regulatable expression unit encoding the polypeptide and a 
chromosomally-located gene encoding a regulatory protein capable of regulating the 
regulatable expression unit, expression of the regulatory protein being controlled by 
a strong promoter. 

More specifically, the invention relates to a prokaryotic host cell for 
producing a heterologous polypeptide comprising a high copy number plasmid 
comprising a repressible expression unit encoding said polypeptide and a 
chromosomally-located gene encoding a repressor capable of repressing said 

3 
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15 



repressible expession unit, expression of said repressor being controlled by a strong 

promoter. & 

More specifically, the host cell is a bacterium, preferably E. coli x expression of 
the heterologous polypeptide is under control of a Lac/-repressed promoter, and 
expression of the Lad repressor is under control of the LaclQl promoter. 

The invention also relates to methods for expressing heterologous proteins 
comprising culturing the above-described host cells and inducing expression of said 
genes. 

The present invention also relates to methods for controlling plasmid 
expression. More particularly, the methods involve the use of a repressor together 
with an inducer to control the high-copy-number-plasmid expression of proteins, 
preferably recombinant proteins. 

The methods can be used in any organism for which regulated promoter 
systems exist for heterologous gene expression, including, for example, species of 
the genera Escherichia, Salmonella, Bacillus, Clostridium, Streptomyces, Staphylococcus 
Neisseria, Lactobacillus, Shigella, and Mycoplasma. 

Brief Descript ion of tht> Figures 

20 Figure 1 is the plasmid map for pSGE705. 

Figure 2 shows a simplified schematic for the construction of expression 
vector, pSGE705. Figures 2A shows the ligation of pBR322 origin and Tet resistance 
gene (PCR fragment). Figure 2B shows the insert Lad from pRGT (PCR fragment) 
Figure 2C shows the insert di-alpha and beta with new promoter region and shorter 
mtergenic spacer. Figure 2D shows the results of site-directed mutagenesis to 
optimize ribosome binding sites and several other modifications. 

Figure 3 shows a simplified schematic for the construction of expression 
vector, pSGE715. Figure 3A shows the ligation of P UC19 origin and Tet resistance 
gene (PCR fragment). Figure 3B shows the insert Lad from pRGl (PCR fragment) 
Figure 3C shows the insert Ptac, di-alpha and beta from pSGE705 into BamHI and 
Hindlll sites of pSGE508. 
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Detailed D^srHp tjon of tho Invention 

As used herein, the term "regulatable expression unit" means a nucleic acid 
molecule comprising a unit of gene expression and regulation, including 
heterologous genes, regulator genes and control elements which are necessary for 
transcription, translation and for recognition by regulator gene products, including 

. RECTIFIED SHEET (RULE 91) 
. ISA/EP 
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repressors. All of the control elements may be contiguous to the heterologous gene 



or not. 



The term "heterologous," when referring to a gene, indicates that the gene has 
been inserted into a host cell either by way of a stable plasmid or through 
integration into the genome, and, when referring to a protein or polypeptide, 
indicates that the protein or polypeptide is the product of a heterologous gene. 
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Heeerologou* proteins are proteins that are normaHy « produced by a particular 

••Hostcen-referstoaprokaryoHcorgarasmconlairung.orhostinK a plasmid 
arhhc,all y constructed using techniques well known in molecular biot^ 
■ Examples mclude Escherichia, Salmonella, Bacillus, Clostridium, StrepiomyZs 

1 x , ' ° F ' ™ 101 ' ,M83 ' JM101 ' ' M103 - JM10S, JM107 JM109 

~ n T 1 '^ 4100 ' MM294 ' NM522 ' NM554 ' TG1 ' X™, XLl-M^ 
Y1089 all of wh.cn are commercially available, for example, from New England 

Biolabs (Internet address: http://www.neb.com). 8 

"High copy number plasmid" refers to a plasmid for use in cloning 
heterologous genes in a host cel. mat is present in greater than about 200 copies per 
cell. Examplesinclude pALTER®-l Vector, P ALTER®-Exl Vector pALTER® IT 

™,® PGEM 32 VeCt0r " P GEM ®^f( + /-) Vectors, P GEM®-7Z (^lectors 
P GEM®-9Z,(-, Vector, pGEM®- U Zf( + /-, Vectors, and'the other P GEM®vectors 

s'pBa e pX 9 Td e8a ;^ C " baSed ^ VeCl0rS ' ***** P UC 8 ' 9 ' 18 «* 
19, pBacPAK8/9, and P AcUW31, and pET all available from Clontech 

, "Medmm copy number plasmid" refers to a plasmid for use in cloning 
heterologous genes in a host cell that is present in about 25-200 copies per ceU 

S£ ^Tll^ 22 ' PMAM ' PMAMne °' PEUK - C1 ' P PUR ' P™~3, PD82-, 
PKK233-2, and pKK388-l (all available from Clontech). 

''Strong promoter" refers to a promoter or promoter mutation that increases 
promoter activity, i.e. transcription rnitiation, a, leas, 10-fold greater man the nanve 
promoter or equrvalent, or stronger than the E. ccli 2a_c promoter, fully induced 
When referrmg to the promoter which controls expression of a chromosomally- 
located repressor, the strong promoter is called a "strong repressor promter » 
Examples mclude the lac. Ja^. fcc. ^ feu. gal and mal promoters 

Regulator" or "regulatory protein" refers to a transcriptional regulatory 
protem wh,ch exerts direct control over gene expression by binding and/or 
releasing DNA a, a specific promote, In particular in the context of the invention 
the regulator is involved in controlling expression of the heterologous gene 

off t / u egUlalaWe 8Sne ° r expression unit" refers to genes or expression units 
effected by a regulator. 

"Repressor" refers to a DNA regulatory protein which exerts direct negative 
control over gene expression at a specific promoter. In particular in the context of 
the invention, the repressor controls expression of the heterologous gene. 

S 
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"Repressible" refers to a regulatory system in which the product of a 
regulator gene (the repressor) blocks transcription of a particular gene, or expression 
unit. 

"Inducer" refers to an effector or molecule which inactivates the repressor 
5 and thereby causes expression of the repressible gene. 

Recombinant prokaryotic systems particularly bacterial systems for 
producing heterologous proteins or polypeptides are well known in the art. The 
genes encoding the target protein can be placed in a suitable expression vector or 
plasmid and inserted into a microorganism or host cell. These host cells may be 
10 produced using standard recombinant DNA techniques and may be grown in cell 
culture or in fermentations. For example, human alpha and beta globin genes of 
human hemoglobin have been cloned and sequenced by Liebhaber et al. (Proc. Natl. 
Acad. Sci. USA 77:7054-7058, 1980) and Marotta et al. (J. Biol. Chem. 252:5040-5053, 
1977) respectively. Techniques for expression of both native and mutant alpha and 
15 beta globins and their assembly into hemoglobin are set forth in U.S. Patent 

5,028,588 to S.J. Hoffman, incorporated herein by reference; K. Nagai and Hoffman, 
SJ. et al., PCT/US90/02654; Townes, T.M. and McCune, S.L., PCT/US91/09624; and 
De Angelo, J. et al., PCT/US91/02568 and PCT/US91/08108 also incorporated 
herein by reference. The host cells of the present invention can be cultured at a 
20 temperature of about, 20°C to about 30°C, more specifically about 24°C to about 
28°C and preferably about 26°C. 

The present invention relates to regulation of the expression of such target 
heterologous proteins. One method for increasing the amount of heterologous 
protein expressed by a host cell involves using a high copy number plasmid into 
25 which is cloned the gene encoding the heterologous protein. Unfortunately, 

however, if expression of the heterologous protein is not tightly controlled, then 
early expression of the protein can have deleterious effects on the host cells. 

To overcome these problems, a system has been invented to tightly control 
expression of the heterologous protein from the high copy number plasmid. The 
30 system employs a chromosomally located gene encoding a regulator of expression of 
the heterologous protein which is under control of a strong promoter. 

The system is useful in a wide variety of prokaryotic host cells as set forth 

above. 

High copy number cloning plasmids useful in these hosts also are set forth 

35 above. 

Strong promoters and associated repressors useful together with the plasmids 
are set forth, for example, in Weickert & Adhya, (Weickert & Adhya. 1992. J. Biol. 
Chem.. 267:15869-74) incorporated herein by reference. In general, any regulator, 
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5 nucleos.des, hypoxanthine, guanine, modified amino acids referred Z aToles H 

to modlrr!' T™ taV ° 1VeS; ( ° ***** P r0teta Whi * - "e used 
. ller o a DNAttri^T'n ' *~ * ""ding, in a reversible 

synfteled torn a ^ 8 ^ ^ ° f * to P»«» 

synthesized from a chromosomal gene(s) to match or exceed the number of DMA 

he te ro ogous gene(s), as part of the transcriptional control mechanism of me 
heterologous g ene(s). The result is an induciMe expression system with many 

Promoters of regulatory proteins in prokaryotes are typically too weak to 
^the^ 

Zmnle „ T u ° n m6diUm '° ^ " Umber P'^ids. For 

lTsT,T t ? " ab ° Ut 10 C ° PieS ° f »* Lac *«P»-« each E. coli cell far 

^han sufftcen. ,o repress a plasmid of 50-2,000 copies per eel, containing orf 

DNA bmdmgs,te for Lac Repressor per plasmid. Thus there would be no 
srgmfican, regulation of Uc Repressor-regulatable genes on these plasmids in these 

1 ^ h i eleVelof P roducli ° r ' of Lac Repressor and other regulatory proteins is 
» low because me promoters directing transcription I the gen^L 
the regulatory proteins are weak (Table A). The DNA sequences of typical 
Procters for expression of regulatory proteins are poor matches to the sequences 
effioen ,y recognized by RNA polymerase (Tab.e A). Mutations which modify he 
promoter can produce a DNA sequence which is more efficient at elevating the 
fransenptton o, the gene encoding a regulatory protein. Examples of Ce 
n^ttonsmcludelaclQand LaclQ1 (TaWeA) These mutanonl improve the match 
of the -35 region of the promoter with the consensus sequence (Table A) The ™ 
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regions of the other promoters shown, and most other regulatory protein promoters, 
can be similarly improved by mutations which improve the match between the 
promoter regions and the consensus sequences at the -35 region and also the -10 
region. Other mutations can also affect the promoter activity, by, for example, 
changing the spacing between the -10 and -35 sequences, and changing the 
nucleotide sequence around the -10 region and the transcription start site. Although 
a particularly useful way to improve transcription and therefore production of a 
regulatory protein may be to engineer, by well known recombinant DNA techniques 
such as site directed mutagenesis, a consensus -35 region, other mutations in the 
entire promoter region also may provide promoter improvements. These promoter 
improvements are referred to as "strong promoters" "strong regulatory promoters", 
or "strong repressor promoters", depending upon their functions. 

Promoter regions in other prokaryotic organisms may similarly be improved 
to make strong promoters. In Table A, examples from Bacillus subtilis and from 
Klebsiella pneumonia are included. The most common (vegetative) promoters for 
these organisms are similar to those of E. coli, although the strength of the the B. 
subtilis promoter is also dependent upon the DNA sequence adjacent to the -10 
region, especially the proximal 5 basepairs (the extended -10 region). The strength 
of the promoter for the gene encoding the regulatory protein of interest can be 
improved by i) improving the DNA sequence homology between the promoter 
consensus sequence and the regulatory protein gene's promoter sequence, and /or ii) 
improving the promoter strength by mutation of sequence outside the consensus 
sequences. Note that the consensus sequences of promoters in organisms other than 
E. coli may not necessarily match the consensus -10 and -35 sequences shown in 
Table A. This is especially true for organisms with a substantially different 
nucleotide bias, for example, thermophilic organisms which have a higher G+C 
content than E. coli. 
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Table A. 



Comparison of promoters for regulatory proteins to the consensus sequences of 
promoters recognized by the most abundant (vegetative) form of RNA 

polymerase. 

-35 _ 10 
Consensus: TTGACA TATAAT 

PfruR: AAAAGCGGrjSC^GAGTGATCAAACTGCGCrj^^GTrAACGATITrAA 
Pmall: TAAATCACATJACGCAACGATAATAGCGGGTAT^AGATAAATAAAAGGTAA 

PrafR: CTTTGGTGA^G^A^TTTTCTGGATTTCCGGCTIAG^AACCACAGCAGGAGATA 

PlacI: fATrrAAT^n^i . . . 

PlacIQ: 



_ . i ^u^, i ; AUAAy.LACJAGCAGG AGATA 

CATCGAAIGGCGCAAAACCTTTCGCGGTATGGC^ig^TAGCGCCCGGAAGAG 
CATCGAATGSISC^CCTITCGCGGTATGGCATCATAGCGCCCGGAAGAG 
PlacIQl: CATITACGTI^CCACCTITCGCGGTATGGCAISAIAGCGGCCGGAAGAG 
PgalR: GCCACCCCJJ^CAACGGGGGTmcCGTAACAOGAAAGAATGTAAGCG 
PgalS: TGACTCGATJ^AAGTCCTGTATTCAGTGCTGACAAAATAGCCCGCCAGC 
15 PccpA: AGTATACGTITJ^^TCATCTATAAAAACGTGIAJ^AITTCATGAGAAGTAAT B s 

Actual transcnpiton start sites are in bold . 
-10 and -35 sequences are underlined 
B.s. = Bacillus subtilis 
20 K.p. = Klebsiella pneumonia 

Since stoichiometry is important, too much regulatory protein needs to be 
avoided if possible. For example, if the strength of a promoter for a gene encoding a 
regulatory protein is increased too much, then the abundance of the regulatory 
25 proteins may be far greater than the number of sites to which they can bind, even on 
a high copy number plasmid. In this case, it may require a large amount of inducer 
to activate transcription of the heterologous gene(s), often an undesirable situation 

After the heterologous protein has been expressed to the desired level it 
generally should be released from the cell to create a crude protein solution This 
30 can usually be done by breaking open the cells, e.g., by sonication, homogenization 
enzymatic lysis or any other cell breakage technique known in the art. The proteins 
can also be released from cells by dilution at a controlled rate with a hypotonic 
buffer so that some contamination with cellular components can be avoided (Shorr 
et al., US Patent 5,264,555). In addition, cells may be engineered to secrete the 
35 protem of interest by methods known in the art. 

After breakage of the cells, or secretion, the target protein is contained in a 
crude cell lysate or a crude cell broth or solution. The protein then may be purified 
according to methods well known in the art. For example, useful purification 
methods for hemoglobin-like proteins are taught in PCT Publication WO 95/14038 
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incorporated herein by reference. Briefly, the methods described therein involve an 
immobilized metal affinity chromatography resin charged with a divalent metal ion 
such as zinc, followed by anion exchange chromatography. According to this 
publication, the solution containing the desired Hb-containing material to be 
5 purified can first be heat treated to remove Protoporphyrin IX-containing Hb. This 
basic purification method can be further followed by a sizing column (S-200), then 
another anion exchange column. Alternatively, this solution can be separated into 
molecular weight fractions using ion exchange chromatography. The resulting 
solution can then be buffer exchanged to the desired formulation buffer. 
10 Appropriate recombinant host cells can be produced according to 

conventional methods or as described in the Examples below. Any suitable host cell 
can be used to express heterologous polypeptides. Suitable host cells include, for 
example, E. coli cells are particularly useful for expressing the heterologous 
polypeptides. 

15 The proteins, so-produced, can be used for their known purposes. For 

example, the hemoglobin-like proteins and compositions containing the globin-like 
polypeptides or the multimeric hemoglobin-like proteins (collectively 
"hemoglobins") can be used for in vitro or in vivo applications. Such in vitro 
applications include, for example, the delivery of oxygen for the enhancement of cell 

20 growth in cell culture by maintaining oxygen levels in vitro (DiSorbo and Reeves, 
PCT publication WO 94/22482, herein incorporated by reference). Moreover, the 
hemoglobin-like protein can be used to remove oxygen from solutions requiring the 
removal of oxygen (Bonaventura and Bonaventura, US Patent 4,343,715, 
incorporated herein by reference) and as reference standards for analytical assays 

25 and instrumentation (Chiang, US Patent 5,320,965, incorporated herein by reference) 
and other such in vitro applications known to those of skill in the art. 

The hemolgobin-like proteins also can be formulated for use in therapeutic 
applications. Example formulations are described in Milne, et aL, WO 95/14038 and 
Gerber et aL, PCT/US95/10232, both herein incorporated by reference. 

30 Pharmaceutical compositions comprising hemoglobin-like proteins can be 
administered by, for example, subcutaneous, intravenous, or intramuscular 
injection, topical or oral administration, large volume parenteral solutions useful as 
blood substitutes, etc. Pharmaceutical compositions can be administered by any 
conventional means such as by oral or aerosol administration, by transdermal or 

35 mucus membrane adsorption, or by injection. 

For example, hemoglobins can be used in compositions useful as substitutes 
for red blood cells in any application where red blood cells are used, or for any 
application in which oxygen delivery is desired. Such hemoglobins, formulated as 

10 
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.red blood cell substitutes, can be used for the treatment of hemorrhages traumas 
and surgenes where blood volume is .ost and either fluid volume 'or oxygen 
carrymg capacity or both must be replaced. ume or oxygen 

• ordered^fT T' ** * gen ° me WaS de ~ d from a set of 
have beerf T , u V " <1987) ' DNA °< <™wn genes 

se~: t P o a OI> , ° rdered ^ ^ Ct al 199Z °< ™ 

sequences to a revsed, mtegrated genomic restriction map, p. 2.3-2 43 In T Miller 

I ZuZ* r B rr al Genetics: a ubor ^ ^ 

r k f BaCleria ' C ° ,d Sprto 8 Harbor Prss, Cold Spring Harbor 

NY). The combustion of known restriction site sequence with known gene 

oneTDM A * 3 °" " d »» d PCR ' ^ "as Ln used to 

p'reso^of r' ream ° ithekC ° Per ° n ta ^ PCR °n two 

pnmers, one of known se q Uen ce and one of random sequence. Because the 

sequence of the flanking restriction sites is known in E coli. a portion ^sequence 
of the second, random PGR primer could be specified. The size of the expected 

dXTd f r diCted ' dePendtaS UP0 " *° ^ * *• -LTprter 

ucsl 6n ea rrom the known DNA sequence. 

_ The sequence upstream of the lad gene in E. coli was determined because of 
£ potential role relating the amount of Lac repressor synthesized 
The sequence of the lad gene is known (Calos, 1978; Farabaugh, 1978) but the 
pubhshed sequence ends just beyond the lad promoter region. As pa t of tnT 
process of makmg the expression system, a region from a wild type E. coli was 

Z t w TT d t0 eStaWiSh WWA P^-ould be des gned 

tat would al.ow PCR amplification of lad aHeles from several strains. An ancTo ed 

^ ° " nd P ; Uned tOWard * e P~»o*r region, and a second primer was 

promoter T^" ° f "<*»< ™ «« 

^J 1 "-"^ Primer ' 12 ™ de ° Hd « •«* contained a fixed half and a 
random half. The 3 (pruning) end contained 6 nucleotides from a restriction site 
known to be upstream of Vlad, and the 5' 6 nucleotides were composed of equa 
proportions of each nucleotide a, each of the six positions. A product ™ ob^ed 
and re-amplified by PCR using a second primer homo.ogous to a differed td 
e^uence from the first prime, but also containing a second site for subcloning into 
PUC19 The insert was sequenced and contained the expected lad sequence and 675 
bp upstream of the lad promoter region, which was not homologous to ajv 
equence „ the nucleotide database, A large open reading frame, divergent from 
W , was also observed. Using this sequence, a specific nucleotide primer wL 
deigned, synthesized, and used to ampnfy Vlacl from the E. coli chLnosome of 
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15 



several strains. PCR fragments of the expected size were observed from the wild 
type control strains, and a fragment polymorphism was used to screen for a 15 
basepair PlacI deletion in several strains, similar to laclQ 1 . 

This technique may be used to close small sequencing gaps when a map is 
known, and quickly extend cloning and /or sequencing into mapped regions 
adjacent to known sequence, from any DNA source. 

The following examples are provided by way of describing specific 
embodiments of the present invention without intending to limit the scope of the 
invention in any way. 

EXAMPLE 1 

Constructions of plasmids containing mutations in copy number: 
medium copy number plasmid; pSGE705 



Recombinant hemoglobin-like protein (rHbl.l, described in, for example, 
PCT publication WO 90/13645, incorporated herein by reference) was produced by 
fermentation of several E. coli strains containing modifications of the lac promoter 
region and/ or location. Construction of the parent E. coli strain 1661 carrying the 

20 plasmid pSGE705 is described below. On January 20, 1994 E. coli strain SGE1661 
was deposited with the American Type Culture Collection (ATCC Accession 
Number 55545). Note that Strain SGE1661 carrying the plasmid pSGE705 was 
denoted SGE1662 (described in PCT publication number WO 95/14038, 
incorporated herein by reference). pSGE705 was a medium copy number plasmid 

25 because it is present in approximately 100 copies per cell. The plasmids used in 

preparing pSGE705 are identified in Table 1, which also provides a brief description 
of each. 

Materials . pBR322, pUC19 and pNEB193 were purchased from New England 
Biolabs, Beverly, Massachusetts. Oligonucleotides were synthesized on an Applied 
30 Biosystems DNA Synthesizer Model 392. The oligonucleotides used in preparing 

pSGE705 are listed in Table 2. Restriction endonucleases were purchased from New 
England Biolabs (Beverly, Massachusetts) and used according to manufacturers 
directions. T4 DNA Ligase was purchased from either New England Biolabs or 
Gibco-BRL (Gaithersburg, Massachusetts) and used according to manufacturer's 
35 directions. Pfu polymerase was purchased from Stratagene (La Jolla, California) and 
used according to manufacturer's directions. 

Media used to culture the strains are described in J. H. Miller, Experiments in 
Molecular Genetics. (Cold Spring Harbor Press 1972). and J. H. Miller, A Short Course 
in Bacterial Genetics. (Cold Spring Harbor Press 1992). Acridine orange, ampicillin 
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and kanamycm sulfate were purchased from Sigma Chemical Co. (St. Louis 
wlZIm). ^ ^ PUrChaS6d ^ < Mil — - 



Table 1. Plasmids 



PLASMTD 



DESCKTPTTOM 



10 pSGEl.lE4 

PSGE1.1E5 
pSGE490 

15 pSGE491 
pSGE492 
pSGE493 
pSGE500 
pSGE504 

20 pSGE505 



pSGE507 
pSGE509 
pSGE513 
pSGE515 
pSGE700 
pSGE705 



25 



30 



pTZ18U 

pDLH-91F 
pNEBl93 



pBR322 
35 pRGl 

pMLB1034 
pSGE654 

40 



rHbl.l expression plasmid containing di-alpha and beta elobin 
genes and resistant to both ampicillin and g 
tetracycline 

like pSGEl.lE4 is ampicillin resistant only 
pUCl9 lad on a BamHI-HmdHI fragment 
pUC19 a on an Ecom-Xbal fragment 
P NEB193 P tac -a 

pUCl9 P on an Xba l-Hindm fragment 

PUC19 a B on a BamHI-HmdHI fragment 

pSELECT-1 replace Sty I with a Pme I site 

pSGE504 rrnB Tl transcriptional terminator in the Eco RI- 
Clal sites 

ColEl ori and tet, 2213 bp 

ColEl ori tet lad, 3425 bp 

ColEl ori tet lad a p, 4386 bp 

ColEl ori tet lacl dice 0 , 4812 bp 

pTZ18U + diap from pSGE515 

modified rHbl.l expression plasmid, ColEl ori , tet, lacl, 
di-alpha and beta genes 

a phagemid derivative of pUC19, for oligonucleotide 

directed mutagenesis 
pGEMl + a missing valine in 2nd position (Des-val) 

Like pUC19 but has more restriction sites in the multi 
cloning sites 

ColEl ori tet amp 
pACYC177 laclQ 

promoterless lacZ gene for promoter cloning, amp, 
pBR322 ori 

constructed from pSGE705 from pSGE715. pBR322 ori, tet, 
di-a, no lacl gene on plasmid 
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Table 2. Oligonucleotides 



OLIGO 



5 EV18 



SEQUENCE (X-y) 



CGGGAATACGGTCTAGATCATTAA 
CGGTATTTCGAAGTCAGAACG 



DESCRIPTION 

C-term of a gene, 
Xbal site 



10 



EV27 



GATCCGAGCTGTTGACAATTAAT 

CATCGGCTCGTATAATGTGT 

GGAATTGTGACGGATAACAATTT 

CACACAGGAAATTAATTAATGCT 

GTCTCC 



tac promoter 
BflmHI-sequence, 
Eagl sites 



EV28 



15 



20 



EV29 



EV30 



25 



GGCCGGAGACAGCATTAATTAAT 
TTCCTGTGTGAAATTGTTATCCGCTCAC 
AATTCCACACATTATACGAGCCGATGA 
TTAATTGTCAACAGCTCG 

TCGGATTCGAATTCCAAGCTGTTGG 

ATGCTTAGATTGAACTGTCTCCGGCCG 

ATAAAACCACCG 

CGGAAGCCCAATCTAGAGGAA 
ATAATATATGCACCTGACTCCG 
GAAGAAAAATCC 



tac promoter 
sequence, Bam HI- 
Eagl sites, 

complement of EV27 

5' end of a with 
EcoRI, BamHl and 
Eagl sites 

5' end of p with 
Xbal site 



EV31 
30 MW007 



35 MW008 



CCCGAAACCAAGCTTCATTAGTGA 
GCTAGCGCGTTAGCAACACC 

TTTAAGCTTCATTAGTGGTATT 
TGTGAGCTAGCGCGT 



CAGCATTAATTAACCTCCTTA 
GTGAAATTGTTATCCG 



3' end of the p gene 
with Hindlll site 

mutagenesis reverse 
primer, replaces last 
three codons of p 
missing in pSGE515 

mutagenesis reverse 
primer to optimize a 
ribozyme binding 
site (RBS) 
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(Table 2 continued) 



MW009 



TG14 
10 TG59 



15 



20 



25 



30 



TG60 



TG62 



TG63 



GGTGCATATATTTACCTCCTT 
ATCTAGATCATTAACGGTATTTCG 



GGTTTAAACC 

GGCGAATAAAAGCTTGCGGCCGCG 

TTGACACCATCGAATGGCGCAAAA 
CC1TTCGCGG- 



GGGCAAATAGGATCCAAAAAAAAG 
CCCGCTCATTAGGCGGGCTTTAT 
CACTG CCCGCTTTCCAGTCGGG 



CCCCGAAAAGGATCCAAGTA 
GCCGGCGGCCGCGTTCCACTG 
AGCGTCAGACCCC 



GGCGGTCCTGTTTAAACGCT 
GCGCTCGGTCGTTCGGCTGCGG 



mutagenesis reverse 
primer to optimize p 
RBS and remove 
second BgRl site 

Pmel linker 

Upstream of lad 
gene, has a Hindlll 
and a Notl site 
upstream of the 
promoter 

Downstream side of 
lacl gene with trp trans- 
scriptional terminator 
and a BarriHl site 

upstream primer for 
pBR322 ori positions 
3170-3148 with 
BamHl and a Notl site 

downstream primer 
for pBR322 ori 
positions 2380-2404 
with a Pmel site 



Genetic and Molecular Biolog ical Prorodnr^ Standard bacterial genetic 
procedures are described in J. H. Miller, Experiments in Molecular Genetics (Cold 
Spnng Harbor Press 1972) and J. H. Miller, A Short Course in Bacterial Genetics (Cold 
Spring Harbor Press, 1992 ). Standard molecular biology procedures were 
performed as described in Sambrook et al., Molecular Cloning, (Cold Spring Harbor 
1 ress, 1989). 



Plasmid DNA Transformation DNA transformations were performed by the 
procedure described in Wensick et al., Ceil 3: 315-325(1974). Briefly cells were 

^ PeUeted ' resus Pended in an equal volume of 10 
mM MgS0 4 and incubated on ice for 30 minutes. The cells were centrifuged and the 
pellet resuspended in .1/2 the original volume of 50 mM CaCl 2 and placed on ice for 
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20 minutes. The cells were centrifuged again and then resuspended in 1/10 the 
original volume of 50 mM CaCl2- Plasmid DNA was added to the competent cells in 
a solution of 10 mM Tris-HCl, pH 8.0, 10 mM MgCl2 and 10 mM CaCl2- The 
mixture was incubated on ice for 15 minutes and then incubated at 37°C for 5 
5 minutes. One milliliter of LB medium was added and the mixture incubated with 
shaking for 30-60 minutes. The culture was then centrifuged, resuspended in 0.1 ml 
of LB medium and plated on the appropriate selective medium. 

Purification of DNA. DNA fragments were purified from an agarose gel 
10 using the Geneclean system (Bio 101, Inc., La Jolla, CA) according to the method 
provided with product. PCR products were prepared and cleaved with restriction 
endonucleases using the Double Geneclean system. (Bio 101, Inc., La Jolla; method 
provided with product.) Briefly, the PCR product was purified away from the PCR 
primers, then the PCR product was cleaved with restriction endonuclease(s) and 
15 purified from the restriction endonuclease and buffer. The PCR product was then 
ready for a ligation reaction. 

Annealing of oligonucleotides. Complementary oligonucleotides were 
annealed according to the following procedure. Equimolar amounts of each 
20 oligonucleotide were mixed in 15-25 \il of 10 mM Tris-HCl, pH 8.0/1 mM EDTA and 
incubated at 65°C for 30 minutes. The sample was transferred to a 37°C water bath 
for 30 minutes. Finally, the sample was incubated on ice for 60 minutes or in the 
refrigerator overnight. 

25 Oligonucleotide directed mutagenesis. Oligonucleotide directed mutagenesis 

was performed with the Muta-gene phagemid in vitro mutagenesis kit (Bio-Rad, 
Hercules, California) according to the manufacturer's instructions which are based 
on the method of Kunkel (Kunkel, T. A. (1985) Proc. Natl Acad. Sci. USA 82: 488; 
Kunkel et al., (1987) Methods EnzymoL 154: 367). The rHbl.l region of pSGE515 

30 was cloned into pTZ18U (Bio-Rad, Hercules, CA or U.S. Biochemical, Cleveland, 
OH) on a BamBl-HindUI fragment to create pSGE700. Three oligonucleotides, 
MW007, MW008 and MW009 were used to simultaneously introduce multiple 
changes in a single reaction. 

35 Preparation of pBR322 on' . PCR primers were designed to amplify the 

pBR322 origin of replication. These primers, TG62 and TG63, annealed to the 
positions 2380-2404 and 3170-3148 on the pBR322 DNA sequence (Sutcliffe, J. G. 
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(1979) Cold Spring Harbor Symp. Quant. Biol. 43: 77-90). The PCR produce was 
Rested WIth NotI and PmeI ^ DNA t ^ P as 

Geneclean procedure. 5 

' , p £ ^atiQrifl£ ^ . Kene fra gment The source for the tet gene was pSELECT-1 
(Promega Corp., Madison, WI). This plasmid has a number of restriction 
endonuclease sites, such as BamHI, HinXIl, Sail and Sphl removed from the tet gene 
(Lew 1S and Thompson (1993) Nucleic Acids Res. 18:3439-3443). A PmeI linker was 
^ertedmtotheS^IsiteofpSELECT-1. This plasmid was designated pSGE504 
Ohgonucleoudes TG71: AAT TCG CGG CCG CAT TCT CGA GCG GAT CCC TCC 
AGC CAA GCT TAA ATA AAA CGA AAG GCT CAG TCG AAA GAC ^GG GCC 
mCGTTTTATCTGTTGATandTG72: CGA TCA ACA G^A^ ACG ^ 
CCA GTC TTT CGA CTG AGC CTT TCG TTT TAT TTA AGC TTG GCT^A 
GGG ATC CGC TCG AGA ATG CCC err rr V, A 

EcnUi rim a. TV WGre annealed and h g at ed to the 

Beam - CM fragment of P SGE504. The resulting plasmid, pSGE505, was shown to 
have the expected restriction endonuclease sites and to have lost the sites present in 
he mulhclorung site of pSELECT-1. pSGE505 was digested with Natl and PmeI. The 
1417 bp fragment was purified according to the Geneclean protocol. 

B^ arationoflacTg enP The lad gene was isolated by amplifying the gene 
sequence from pRGl (Dana-Farber Cancer Inst., Boston) that carried The lad gL 
The PGR pnmers, TG59 and TG60 were designed to generate a wild type lad ' 
promoter (Farabaugh, P. J. (1978) Nature 274:765), upstream of the gene and to place 

111?, mmat ° r S6qUenCe (ChriStie Gt al " (1981) ProC - NatL Acad - Sci- USA 78:4180- 
4184) downstream of the gene. The same step could be carried out using Y1089 

(Promega) or chromosomal DNA from any E. coli strain carrying the lac reeion such 
as MM294 (ATCC 33625, The PGR product was gel purifJand isolat^Ig 
l^SST Pr ° CedUre ^ d ° ned BamHI - HindlU Rested pUCl9 DNA to 

Construction ofp SHFSH. PC R primers EV29 and EV18 were chosen to 
amplify the alpha globin gene from pDLII-91F (Hoffman et al., WO 90/13645) The 
purged PCR product was cleaved with the restriction endonucleases Eagl and Xbal. 

To create a plasmid that contained P tac -a, the alpha globin gene (from above) 
and the tac promoter, which was prepared by annealing EV27 and EV28 were 
mixed with Eco Rl-Xba I cleaved P UC19 DNA. The mixture of the three DNA 

'7 
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fragments, in approximately equimolar ratio, was treated with T4 DNA Ligase.- 
After incubation the ligation mixture was used to transform SGE476 and ampicillin 
resistant transformants were selected. (Transformation into Strain MM294 (ATCC 
33625) yields equivalent results.) An isolate with the correct restriction 
endonuclease fragments (consistent with Figure 1) was designated pSGE492. The a 
gene and the tac promoter DNA sequences were verified by DNA sequencing. 

Primers EV30 and EV31 were used to amplify the P globin gene from 
pSGEl.lE4 by PCR. The purified p gene fragment was digested with Xbal and 
Hindlll and then mixed with Xbal-HindUl digested pUCl9 DNA and treated with T4 
DNA ligase. The ligation mixture was used to transform competent SGE476 
(equivalent to MM294, ATCC 33625) and transformants were selected on LB + 
ampicillin (100 ug/ml) plates. An isolate that contained the appropriate restriction 
endonuclease fragments (consistent with Figure 1) was chosen and designated 
pSGE493. The P gene was confirmed by DNA sequencing. 

The p gene was isolated from pSGE493 by restriction with Xbal and Hindlll 
followed by purification according to the Geneclean method. This DNA fragment 
was then ligated to Xbal-Hindlll restricted pSGE492 DNA and transformed into 
SGE713. (Any dam- strain such as JM110 (ATCC 47013) or GM119 (ATCC 53339) 
could also be used.) An ampicillin resistant transformant that carried a plasmid that 
had the appropriate restriction fragments (consistent with Figure 1) was chosen and 
designated pUC19ccP (pSGE500). 

The BamHl-Hind III fragment that contained the a and p globin genes of 
pSGE500 was purified according to the Geneclean method. An Xhol fragment that 
carried a portion of the di-a gene containing the glycine linker region was gel 
purified from pSGEl.lE5. pSGEl.lE5 (described in Hoffman et al., United States 
Serial Number 789,179, filed November 8, 1991) is a tetracycline sensitive analogue 
of P SGE1.1E4 (Hoffman et al., WO 90/13645), which could also have been used. 

The pBR322 origin of replication region (pBR322 ori, above) was ligated to the 
tet gene fragment (above) and the ligation mixture was transformed into SGE476. 
(Transformation into MM294, above, would yield equivalent results.) Tetracycline 
resistant transformants were selected and plasmid DNA was isolated and analyzed. 
An isolate that contained the appropriate restriction endonuclease fragments was 
chosen and designated pSGE507. 

Next, pSGE507 and SGE490 were digested with BamHI and Notl and the 
appropriate fragments were purified. The two purified fragments were ligated 
together and the ligation mixture was used to transform competent SGE713. (Any 
dam- strain could also be used; see above.) Tetracycline resistant transformants were 
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selected, and plasmid DNA was isolated and analyzed. A plasmid that had the 
appropriate restriction fragments was chosen and designated pSGE509 

The purified BamHI-Hindlll fragment of pSGESOO that contained the a and 0 
globrn genes was ligated to BamHI-Hindlll digested P SGE509. The ligation mixture 
was used to transform P SGE713 (see above for equivalent strains) and tetracycline 
resistant transformants were selected and characterized. An isolate yielding the 
correct size plasmid with the expected restriction endonuclease fragments was 
chosen and designated pSGE513. 

The Xhol fragment of pSGEl.lE5 (described in Hoffman et al, United States 
Serial Number 07/789,179, filed November 8, 1991, incorporated herein by 
reference) that contained the di-a glycine linker sequence was ligated to Xhol 
digested pSGE513 to create a plasmid that contained the di-a gene. SGE753 was 
transformed with the ligation mixture and tetracycline resistant transformants were 
selected. (Transformation into SGE800 would have yielded equivalent results ) 
Isolates were screened to identify those that contained the Xhol fragment inserted 
into pSGE513 in the correct orientation. An isolate that contained the correct 
configuration of the di-a gene, as determined by restriction endonuclease analysis 
with Ea^I, was designated pSGE515. 

Modification of pSGF.515 to create p SGF7ns. The DNA sequence record used to 
design PCR primers for the amplification of the P gene did not contain the C- 
terminal three amino acids. Oligonucleotide directed mutagenesis was used to add 
these nine nucleotides to the DNA sequence of the P gene. In the same reactions, 
modifications were introduced to optimize the ribosome binding sites for the di-a 
and p genes, and to remove a Bglll site near the end of the di-a gene. The HindUl- 
BamHl fragment from pSGE515 was subcloned into pTZ18U, creating pSGE700. 
pSGE700 was then used as a source of ssDNA for site-directed mutagenesis. 

The following are the changes that were made with the oligonucleotides 
MW008 and MW009 to optimize ribosomal binding sites and to remove a Bgll 
restriction endonuclease site. 

di alpha 

before - CAATTTCAC - - AGGAAATTAATTAATGCTG 
jJMM 111**1 |||**|||||||IMMI 
after - CAATTTCACTAAGGAGGTTAATTAATGCTG 

Four nucleotide changes, shown above, including the insertion of two nucleotides 
were introduced with MW008 to optimize the ribosome binding site for di-alpha. (' I 
- indicates identity, * - indicates a change) 
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-beta 

before - TAAaGATCTAGA GGAAATAA- TATATGCAC 

, , MI*IMIIIIi***lll**lll*lllllllll 

D after - TAATGATCTAGATAAGGAGGTAAATATATGCAC 

The six nucleotide changes shown above, including the insertion of four nucleotides, 
were introduced with MW009 to optimize the ribosome binding site for beta. The 
lower case "a" on the before strand was a T to A mutation in the construction of the 
10 alpha gene that introduced a Bgl II site into the sequence. This was removed so that 
there would only be a single Bgl II site in pSGE705. ( i )- indicates identity, * - 
indicates a change) 

End of Beta 

15 

before - CTCGCTCAC TAATGAA 

I II I I I II |- — — | | | | | | | 
after - C TC G C TC AC AAAT AC C AC TAATGAA 

20 MW007 introduced the coding sequence for the last three amino acids of the beta 
gene as shown above. ( I - indicates identity, * - indicates a change) 

Putative mutants were screened for loss of a Bglll restriction endonuclease 
cleavage site (introduced by MW008). Seventeen of 24 had lost the site and were 

25 further characterized by DNA sequencing at the other two mutagenized sites. One 
of the 17 had incorporated all the modifications from the three oligonucleotides. 
These changes were verified by DNA sequencing and the rHbl.l genes were cloned 
into BflmHI-HmdIII digested pSGE509. An isolate that had the correct restriction 
endonuclease fragments was designated pSGE705. 

30 A plasmid map of pSGE705 is shown in Figure 1. The plasmid map indicates 

many of the restriction endonuclease cleavage sites. pSGE705 is smaller than its 
counterpart pSGEl.lE4, and the placement of its restriction sites facilitates modular 
alterations of the sequence. An unused antibiotic resistance marker was removed, 
and a promoter was added to the lad gene that would allow tighter control of 

35 rHbl.l expression. 

A new sequence upstream of the a gene minimized the distance between the 
tac promoter (De Boer et al, Proc. Natl. Acad. Sci. USA 80, 21-25, 1983) and the first 
codon of the alpha gene. The intergenic region between the di-oc gene and the P 
gene was also designed to contain the minimum sequence that contained a 

40 restriction endonuclease site and the ribosome binding site for the (3 gene. 
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Construction of plasmids containing mutations in copy number: 
High Copy Number Plasmid, pSGE720 

The construction of pSGE720 was performed in two stages. First, the pUC 
ongin of replication was introduced into PSGE705 to create plasmid P SGE715 which 
» smular to P SGE705 in that it includes the lad gene. Then, the lad gene was delved 
from the plasmid and replaced with a short oligonucleotide containing several 
convenient restriction sites to create plasmid pSGE720. 

A - Construr tion of p £GF7l c; 

The pUC origin of replication was introduced to create plasmid pSGE715 
through pSGE508, which is identical to pSGE509 with the exception of a single 
basepair substitution at base 602 (G->A). The substitution changes the P BR322 
origin of replication to a P UC19 origin of replication. 

Plasmids pSGESOS and P SGE705 were digested to completion with restriction 
enzymes BamHL and Hmdin, according to the manufacturer's instructions (New 
England Biolabs.). The plasmid, pSGESOS, was digested first with BamHl to 
completion, then HmdlH was added, and the digestion continued. The P SGE705 
digest was purified with Promega Magic DNA Clean-up protocols and reagents 
Promega Madison, WI) and further digested to completion with Bgll, according to 

^ fl U T ei ' S inStrUCti ° ns (New En S land B -labs). The enzymes in both the 
PSGE508 and P SGE705 digests were inactivated by heating at 67°C for 10 minutes, 
then the DNA was pooled and purified together using Promega Magic DNA Clean- 
up protocols and reagents. The DNA was suspended in ligation buffer, T4 DNA 
hgase was added to one aliquot, and the DNA was incubated overnight at 16°C 
SGE1661 cells were made competent by the method of Hanahan, using Rubidium 
Chloride (Hanahan, D., In DNA Cloning; A Practical Approach (Glover, D. M ed ) vol 
1, PP-109-135, IRL Press, Oxford, 1985), and transformed with the ligation mix ' 
according to the Hanahan protocol. Transformants were selected by plating the cells 
on LB plates containing 15ug/ml tetracycline. Candidates were screened by 
restriction digestion to determine the presence of the rHbl.l genes, and sequencing 
to detect the pUC origin of replication. Several candidates were identified, and the 
resulting plasmid was named pSGE715. P SGE715 in SGE1661 was called SGE1453 
The copy number of pSGE715 is about four-fold higher than pSGE705 

^^^^^'^^^ As noted above, the difference between 
PSGE705 and pSGE715 is a sing l e basepair change in the origin of replication region, 
which has been confirmed by sequencing. 
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B. Construrrion of p SnF7?n 

The lad gene was deleted from P SGE715, replacing it with a short 
oligonucleotide containing several convenient restriction sites, by the following 
steps. First, plasmid pSGE715 was digested to completion with restriction enzymes 
BamUl and NotI, according to the manufacturer's instructions (New England 
Biolabs). The pSGE715 digest was purified with Promega Magic DNA Clean-up 
protocols and reagents. The DNA was mixed with annealed, kinased 
oligonucleotides, CBG17 + CBG18, and suspended in ligation buffer. 

CBG17 = 5'-GGCCGCCTTAAGTACCCGGGTTTCTGCAGAAAGCCCGCCTA 
ATGAGCGGGCTTTTTTTTCCTTAGGG-3' 

CBG18 = 5'-GATCCCCTAAGGAAAAAAAAGCCCGCTCATTAGGCGGGCTTT 
CTGCAGAAACCCGGGTACTTAAGGC-3' 



T4 DNA ligase was added to one aliquot, and the DNA was incubated 
overnight at 16°C. SGE1821 cells (SGE1661 + pRGl ( P ACYC177 with laclQ) (Dana- 
Farber Cancer Institute, Boston, MA)) were made competent by the method of 
Hanahan, using Rubidium Chloride, and transformed with the ligation mix 
according to the Hanahan protocol. SGE1821 contains pRGl plasmids in addition to 
pSGE720. pRGl is a low copy number plasmid containing LacK Transformants 
were selected by plating the cells on LB plates containing 15ug/ml tetracycline. 
Candidates were screened by restriction digestion using Pstl and Smal to detect the 
presence of the new linker and the absence of the lad gene, and sequenced to detect 
the pUC origin of replication and the absence of the lad gene. Several candidates 
were identified, and the resulting plasmid was named pSGE720. The plasmid, 
pSGE720 in SGE1675 is called SGE1464. 



EXAMPLE 3 

Construction of Strains containing Various lad Chromosomal Allel 

SGE1661, SGE1670 and SGE1675 



Strains SGE1494 and SGE1495 which contained laciq on the chromosome 
were purchased from ATCC, (ATCC accession numbers 47041 and 47043 
respectively). SGE77 was purchased from Stratagene, Inc. (Catalogue number 
D1210) and also contained a lad 0 on an F' episome. 
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Strain SGE1661 contained a wild type chromosomal lad allele. This strain 
was used as the starting material for the construction of strains containing 
alternative chromosomal lad alleles. 

To examine the effect of increasing the strength of the chromosomal lad 
promoter, strain SGE1670 was constructed. Strain SGE1670 containing the ladl* 
allele was constructed from SGE1661 by Pi bacteriophage transduction using a 
lysate grown on SGE299, selecting for kanamycin resistant transductants. Strain 
SGE299 contained a putative ladV allele on an F episome adjacent a lacZ gene into 
which a Tn5 transposon has been inserted to inactivate lacZ. The transposon 
insertion conferred kanamycin resistance to the cells. SGE299 is also know as 
AG1688 or RDK2759 and is described by Hu et al., (Protein Science, 2: 1072-1084, 
1993). Alternatively, one skilled in the art could use any strain having a lad or lacW 
allele and mutating the sequence to yield lad^l using the sequence of Calos & Miller 
Mol. Gen. Genet . 183:559-560 (1981) as a guide. PCR analysis of SGE299 (described ' 
below) demonstrated that the putative ladl was lad^ , or a variant thereof. 

In strain SGE1675, the lac operon functionality on the chromosome was 
restored without affecting the lad allele. The presence of the transposon insertion in 
lacZ in SGE1670 resulted in a polar mutation that destroyed the function lacY and 
lacA involved in lactose metabolism and transport, which may have affected the 
ability to induce expression of P t ac on the plasmid with IPTG, and thus may have 
affected beta-galactosidase or hemoglobin production from a given plasmid. 
SGE1675 was constructed by transduction of SGE1670 from a PI lysate prepared on 
SGE765. SGE765 was a strain containing a Tn5 insertion into the lad gene conferring 
kanamycin resistance to the cells. In addition, SGE765 contained a wild type copy of 
lacZ. Strain SGE765 was made by PI transduction from a lysate made on MS24 into 
strain C3000. MS24 is also known as MG1655 lad3098::Tnl0kan f a cZA118, and is 
described in Singer, et al. (Microbiol. Rev. 53: 1-24, 1989). Strain C3000 is available 
from the American Type Culture Collection, ATCC # 15597. 

Transductants were selected for their ability to grow on minimal medium 
containing lactose as the sole carbon source and screened for sensitivity to 
kanamycin. Transductants that were sensitive to kanamycin and were lac+, were 
screened for their ability to regulate (repress) the expression of beta-galactosidase 
from plasmid P SGE714 (see below). Note: pSGE714 does not contain a lad on the 
plasmid and was examined for repression of lacZ as described for strain SGE1670. 
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EXAMPLE 4 

Medium Copy Number LacZ Fusion Plasmid containing lad: pSGE712 

5 pSGE712 containing the lad gene and a fusion of the lacZ gene to the 5' end of 

the beta globin gene was constructed as described below. This construct provided a 
convenient tool for screening control of expression as a function of beta- 
galactosidase activity in the cells. Thus in the absence of inducer, the expression of 
beta-galactosidase provided a sensitive measure of control of expression by the 

10 product of the lac repressor gene expressed primarily from the plasmid. 

pSGE705 was digested with SaR and BgllL pMBL1034 (obtained from Sankar 
Adhya, NIH, and described in Miller, G. H., Experiments in Gene Fusions, (1984), 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY) was digested with 
BamHL and Ball. The two digests were cleaned using a Promega MagicDNA clean- 

15 up column using the manufacturer's recommended procedures, and the DNA was 
ligated using T4 DNA ligase. The ligation mix was used to transform JM109 
competent cells. Candidates were ampicillin resistant and blue colonies on LB amp 
plates containing X-gal and were screened by BamHI/Hindlll digestion for the 
appropriate plasmid conformation. Correct candidates contained a BamHI-HindUI 

20 fragment almost 3 Kilobase pairs longer than the original plasmid, indicating the 
presence of the lacZ gene on the plasmid. 

EXAMPLE 5 

25 Construction of a Medium Copy Number LacZ Fusion Plasmid without lad: 

pSGE714 

pSGE714 containing a fusion of the lacZ gene to the 5' end of the beta globin 
gene was constructed as described in Example 4, except that pSGE654 was used 

30 instead of pSGE705 as the starting material. Note that unlike pSGE712, pSGE714 did 
not contain lad. This construct provided a convenient tool for screening control of 
expression as a function of beta-galactosidase activity in the cells. Thus, in the 
absence of inducer, the expression of beta-galactosidase provided a sensitive 
measure of control of expression by the product of the lac repressor gene expressed 

35 from the chromosome of the cell. 
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EXAMPLE 6 

Construction of a High Copy Number LacZ Fusion Plasmid without lad: 

pSGE721 

PSGE721 containing a fusion of the lacZ gene to the 5' end of the beta globin 
gene was constructed as described in Example 4, except that P SGE720 was used 
instead of pSGE705 as the starting material. This construct provided a convenient 
tool for screening control of expression as a function of beta-galactosidase activity in 
the cells. Thus, in the absence of inducer, the expression of beta-galactosidase 
provided a sensitive measure of control of expression by the product of the lac 
repressor gene expressed from the chromosome of the cell. This plasmid results in 
approximately 500 copies of the plasmid per cell. In order to control the expression 
of beta-galactosidase from these plasmid copies, more repressor must be expressed 
from the chromosomal copy of lad per cell than there are plasmids per cell. 

EXAMPLE 7 

Construction of a high copy number plasmid containing a single alpha globin 

gene and wild type beta gene: pSGE728 

The construction of pSGE728 was performed by digesting plasmid pSGE720 
with an enzyme that cuts only within each of the two alpha subunits of the di-alpha 
gene encoding the globin-like protein, followed by ligation, to preferentially 
reconstruct deletions of one alpha subunit, and the di-alpha glycine linker The 
resulting plasmid, P SGE726, contains a single alpha gene rather than a di-alpha 
gene. The Presbyterian mutation of human hemoglobin in the beta globin gene was 
replaced by a second digestion and ligation that introduced the wild-type beta and 
created pSGE728. 

A - Construction of p SGF.796 

pSGE720 was digested with restriction enzyme Xhol, according to the 
manufacturer's instructions (New England Biolabs). The pSGE720 digest was 
purified with Promega Magic DNA Clean-up protocols and reagents (Promega 
Madison, WI) and suspended in ligation buffer. T4 DNA ligase was added and the 
DNA was incubated overnight at 16°C SGE1675 cells were made competent by the 
method of Hanahan (Hanahan, D, ibid), and transformed with the ligation mixture 
according -to the referenced protocol. Transformants were selected by plating the 
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cells on LB plates containing 15ug/ml tetracycline. Candidates were screened by 
restriction digestion to determine the presence of a mono-alpha globin gene, rather 
than the di-alpha globin gene present in pSGE720. Several candidates were 
identified, and the resulting plasmid was named pSGE726 in SGE1675 was called 
5 SGE1480. This plasmid expressed the hemoglobin-like molecule rHbl.O, mono- 
alpha plus beta Pres byterian 

B. Construction of pSGF7?8 

pSGE726 and pSGE0.0E4 (Hoffman et al., 1990. Proc. Natl. Acad. Sci. USA. 

10 87:8521-8525 and Looker et al., 1992. Nature, 356:258-260) were digested with 

restriction enzymes Bglll and Hmrfin, according to the manufacturer's instructions 
(New England Biolabs). The digests were loaded into wells in a 1% agarose gel, and 
subjected to electrophoresis in Tris acetate buffer as described in Maniatis et al. 
(Maniatis, T., E. F. Fritsch, and J. Sambrook. 1982. Molecular Cloning. Cold Spring 

15 Harbor Laboratory, Cold Spring Harbor, NY). The small DNA band of about 300 

basepairs (bp) from the pSGE0.0E4 digest, and the large band of about 3,400 bp from 
the pSGE726 digest, were purified by excising them from the agarose gel, pooling 
them into a 1.7ml tube and purifying them with GeneClean Kit ptotocols and 
reagents (BIO 101, La Jolla, CA). Purified, pooled DNA fragments were suspended 

20 in ligation buffer. T4 DNA ligase was added and the DNA was incubated overnight 
at 16°C. SGE1675 cells were made competent by the method of Hanahan (Hanahan, 
D, ibid), and transformed with the ligation mixture according to the referenced 
ptotocol. Transformants were selected by plating the cells on LB plates containing 
15g/ ml tetracycline. Candidates were screened by restriction digestion with Sea I 

25 according to the manufacturer's instructions (New England Biolabs), to determine 
the presence of a wild type beta globin gene, which is not susceptible to Seal 
digestion, rather than the beta Presb yterian gene which is cleaved by Seal. Several 
candidates were identified. The resulting plasmid was named pSGE728. pSGE728 
in SGE1675 was called SGE1483. This plasmid expressed rHbO.0, mono-alpha plus 

30 wild type beta. 

EXAMPLE 8 

Plasmid Copy Number Measurement 

Plasmid copy-number was determined by comparison of the band intensity of 
35 linearized plasmids to that of known quantities of HmdIII-digested lambda DNA 
(New England Biolabs) on Ethidium Bromide-stained TAE agarose gels. Plasmid 
DNA was extracted from bacterial cultures using the Wizard Miniprep Kit 
(Promega). The OD600 of these cultures was determined before plasmid 
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purification. Using the conversion factor of 8 x 10* cells/OD'ml, the OD 600 the 
culture volume used, and the mass of DNA determined by comparison to the 
standard, the copy number was calculated. 



EXAMPLE 9 

Measurement of beta-galactosidase expression 

Beta-galactosidase was measured using the technique described in Miller 
Cells were grown with agitation in supplemented minimal medium (M63 + 
casamino acids) at 37 or 30°C in 25 ml flasks or 20 ml test tubes. Expression was 
induced by the addition of various concentrations of IPTG to the cells in mid- log 
phase (OD 600 nm >0.2, but <0.8) and incubation was continued for 3-5 hours before 
sampling for beta-galactosidase analysis. 

EXAMPLE 10 
Determination of Hemoglobin Expression 

Fermentatinyi 

Hemoglobin-like proteins are expressed in the strains described above using a 
standard fermentation protocol. First, a fermentor inoculum is grown from seed 
stock. The inoculum is then transferred to a 15 liter fermentor and induced. The 
details of the fermentation process are described below. 
Seed 9>tnc\c 

Seed stock is grown up in LB broth containing 10 g/L BactoTryptone™ 5 e/L 
yeast extract, 5 g/L NaCl, 0.2 g/L NaOH, and 10 ug/ml tetracycline to an optical 
density of 1.5 - 1.7 at 600 nm. The solution is then made up to 10% glycerol and 
stored at -80°C until required. 

Fermentor Tnoculum f.5nn ml broth in 7 T s hak P flasks) 

To prepare the fermentor inoculum, seed stock is thawed and 0 1-0 4 ml of 
seed stock are inoculated into 500 ml of a solution containing approximately 4 g/L 
KH 2 P0 4 , 7 g/L K 2 HP0 4 , 2 g/L (NH^SO^ 1 g/L Na 3 Citrate-2H 2 0, 153 mg/L 
M g S0 4 -7H 2 0, 2.3 g/L of L-proline, 2 g/L yeast extract, 4.8-5 g/L glucose, 75 mg/L 
thiamine HC1, 12 mg/L tetracycline, 81 mg/L FeCl 3 -6H 2 0, 4 mg/L ZnCl 2 6 mg/L 
CoCl 2 -6H 2 0, 6 mg/L Na 2 Mo0 4 -2H 2 0, 3.1 mg/L CaCl 2 -2H 2 0, 3.9 mg/L ' 
Cu(II)S0 4 -5H 2 0, 1.5 mg/L H 3 B0 3 , and 300 ul/L HC1. This culture is allowed to 
grow for 10 hours at 37°C on a shaker. Four flasks are combined and used to 
inoculate the 15L Fermentors. 
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15L Fermentor H4 T , volume in 20 I, Fermentor - "IST") 

The entire fermentor inoculum is then asceptically transferred to a 20-liter 
fermentor containing 10 liters of the following: 1.8 g/L KH 2 P0 4/ 3.3 g/L K 2 HP0 4 
,1.8 g/L (NH4) 2 S0 4 , 155 mg/L thiamine HC1 , 10.3 mg/L tetracycline, 3.1 g/L 
proline, 1.9 g/L MgS0 4 -7H 2 0, 1.9 g/L Na 3 -citrate-2H 2 0, 133 mg/L FeCl 3 -6H 2 0, 6.4 
mg/L ZnCl 2 , 9.9 mg/L CoCl 2 -6H 2 0, 9.9 mg/L Na 2 Mo0 4 -2H 2 0, 5 mg/L 
CaCl 2 -2H 2 0, 6.3 mg/L Cu(II)S0 4 -5H 2 0, 2.5 mg/L H3BO3, and 494 ul/L HC1. Note 
that masses of added reagents are calculated using the final volume of fermentation, 
14 liters, and are approximate within measurement error. The pH is maintained at 
6.8 to 6.95 by addition of 15% to 30% NFLjOH, dissolved oxygen is maintained at or 
above 20%, and 50 to 70% glucose is added throughout the growth period, sufficient 
to maintain low but adequate levels of glucose in the culture (0.1 g/L-10 g/L). 
Dissolved oxygen is maintained as close to 20% as possible. The culture is grown 
between 28 and 32°C for approximately 12 hours prior to transfer to the 600 liter 
15 fermentor, if such transfer is required, or alternatively is induced by IPTG addition 
and grown for 10-16 additional hours as described below. 

600L Fermentations 

The entire seed fermentor inoculum is then asceptically transferred to a 600- 

20 liter fermentor containing approximately 375 liters of the solution containing: 1.8 
g/L KH 2 P0 4 , 3.3 g/L K 2 HP0 4 , 1.8 g/L (NH 4 ) 2 S0 4 3.3 ml/L polypropylene glycol- 
2000, 220 g/L glucose, 143 mg/L thiamine HC1, 9.4 mg/L tetracycline, 1.4 g/L 
MgS0 4 -7H 2 0, 1.4 g/L Na 3 -citrate-2H 2 0, 2.9 g/L L-proline, 99 mg/L FeCl 3 -6H 2 04, .8 
mg/L ZnCl 2 , 7.3 mg/L CoCl 2 -6H 2 0, 7.3 mg/L Na 2 Mo0 4 -2H 2 0, 3.7 mg/L 

25 CaCl 2 -2H 2 0, 4.7 mg/L Cu(II)S0 4 -5H 2 0, 1.8 mg/L H 3 BO s , and 366 ul/L HC1. Note 
the reagent additions are calculated with the final volume of the fermentation, 450 
liters, and that all masses are approximate. 

Whether at 600 L scale or at 15 L scale, the pH is maintained at 6.8 - 6.95 by 
addition of 15% to 30% NPLjOH, dissolved oxygen is maintained at or above 20%, 

30 and 50-70% glucose is added throughout the growth period, sufficient to maintain 
low but adequate levels of glucose in the culture (0.1 g/L-10 g/L). The culture is 
grown between 24 and 30°C to an OD 6 oo ~ 10-40 prior to induction with 10-1000 /iM 
IPTG. Upon induction of hemoglobin synthesis, the E. coli heme biosynthesis is 
supplemented by addition of hemin dissolved in 1 N NaOH, either by addition of 

35 the total mass of hemin required at induction, by continuous addition of hemin 

throughout the induction period, or by periodic addition of hemin dissolved in 50 
mM to 1 M NaOH (e.g. one third of the total mass of hemin to be added to the 
fermentor is added at induction, another third is added after 1/4 of the total time 
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after fermentation has elapsed, and the last third is added half-way through the 
induchon period). Total hemin added ranges from 50 to 300 mg/L. The 
fermentation is allowed to continue for 8-16 hours post-induction. At the end of this 
period, several 1 ml aliquots are removed from the broth for determination of 
hemoglobin production. 

The fermentor is run at several temperatures (see further examples below), 
controlling dissolved oxygen at 20% and glucose between 0-6 g/L. At OD 30 ± 2,' 
induction occurs by adding various amounts of 100 mM IPTG to yield several 
different concentrations of inducer, as further described below. In 15L fermentations 
at induction, 10 mL of 50 mg/mL hemin is added along with the appropriate 
volume of IPTG. At 3 hours post induction, 13 mL of 50 mg/mL hemin is added 
and at 6 hours post induction, 17 mL of 50 mg/mL hemin is added. Harvest and 
further purification occurs at 10-16 hours post induction. 

15 Soluble Hemoglobin Measurement by Immobilized Metal Affinity Chromatography 

Samples were collected as above and pelleted. Cells were lysed and DNA was 
fragmented as described below: 

20 1. Cell Lysis 

The samples were placed on ice and protected from light. 1.00 ± 0.01 mL of 25 OmM 
Na 2 B4O7°10H 2 0 was added to the cell pellet and the sample was vortexed until the 
contents were resuspended. 30 ± 1 uL of Lysozyme NaCl solution was added to the 
resuspended pellet and the sample was re-vortexed. The samples were incubated on 
25 ice for 30 - 45 minutes while protected from light. Following this incubation, the 

samples were mixed at least three times and placed in a 37° ± 1°C water bath for 2.25 
± 0.25 minutes. After incubation at 37°C, the samples were again mixed. 

30 2. DNA Fragmentation 

30 ± 1 uL of DNase was added to the sample which was then vortexed. The sample 
was then incubated with rocking for 15 minutes at room temperature, while 
ensuring that the sample was protected from light. 

Hemoglobin was extracted from the samples after cell lysis by first freezing at 
-80°C for greater than three hours. The samples were then thawed by rocking at 
room temperature in the dark. After thawing, samples were mixed and then 
centrifuged for 20 minutes at 14,000 rpm in a microfuge. The supernatant was 
removed and treated with CO gas for 30 seconds. This CO treated supernatant was 
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then heated for 4 minutes at 65°C, if necessary, then spun for 4 minutes in a - 
microcentrifuge at 14,000 rpm. 0.1 ml of the supernatant was then diluted with 0.9 
ml of 5 mM sodium phosphate, 150 mM sodium chloride, pH 7.4, and filtered into 
an autosampler vial. 

Samples were then analyzed on a Biocad Perfusion Chromatography 
Workstation (PerSeptive Biosystems), equipped with a Poros (MC/M) metal chelate 
perfusion column, 4.6 mm x 100 mm, (PerSeptive Biosystems) charged with zinc. 
The column was run at a flow rate of 6 ml/min and the effluent was monitored at 
412 nm. The column was loaded with 3 column volumes of 10% 2 M NaCl+ 80mM 
Tris-HCl, pH8.0/90% water, washed with 4 column volumes of 25% NaCl/75% 
water and eluted with 50% 2 M NaCl+ 80mM Tris-HCl, pH8.0/50% 50mM EDTA 
pH8.5. 



Soluble Hemoglobin Measurement by Difference Assay 

Five microliters of hemoglobin solutions were added to 500 ul of 0.1 M Tris, 
pH 8.0. 200 ul of the diluted hemoglobin solution was then added to 2.8 ml of 0.1 M 
Tris, pH 8.0, in a 4.5 ml cuvette for a final dilution of 1:1500. The oxygenated sample 
(Hb02) was then analyzed by spectrophotometry in a Hewlett-Packard model HP 
8452A spectrophotometer. Absorbances at 436, 425, 420, 404, 400 nm were collected 
and stored in a data storage system. The cuvette was then removed from the 
spectrophotometer and sparged with carbon monoxide two times for 15 seconds 
each time. The cuvette was inverted 5 times between each sparge. The sample was 
then re-inserted into the spectrophotometer, and a second set of spectra were 
collected that corresponded to carbonmonoxy hemoglobin (HbCO). The cuvette was 
then again removed from the spectrophotometer and 30 ul of 0.1M KCN in 0.1 M 
Tris, pH 8.0, was added to the sample. The sample was then inverted three times, 
allowed to incubate for 5 minutes, and re-inserted into the spectrophotometer for a 
final spectrophotometric analysis (HbCN). 

The. concentration of total hemoglobin was determined using the following 
quantities: 



a^i-w rn.^ r HbCO Hb02 r HbCO HbOl 

AO.D.(rHb )= [a 42q - A 420 ] - [a^" A, 36 ] 

AO.D.(rHb+)= r A HbC - A™ 500 !- T A HkCN A HbCO] 
LA 424 A 424 J LA 4Q4 ^ 4Q4 J 

where A = the absorbance at the susbcripted wavelength for the superscripted 
hemoglobin species. 

20 
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Using these values, total reduced hemoglobin (rHb) and oxidized hemoglobin - 
(rHb+) was calculated as follows: 

[rHb (mg/ml)] = AO.D.(rHb)/ 5.132 
[rHb+ (mg/ml)] = AO.D. (rHb+)/7.169 
Total hemoglobin (mg/ml) is then simply [rHb] + [rHb+]. 

Insoluble Hemoglobin Measurement by Western Blot 

Cells were lysed as described for the Immobilized Metal Affinity 
Chromotography method above. The soluble and insoluble fractions were separated 
by centrifuging the lysate for 15 minutes in a microfuge at top speed. The 
supernatant (soluble fraction) was transferred to another microfuge tube. The 
soluble fractions were assayed for total protein using the BCA (Bicinchoninic acid) 
assay (Pierce, #23225X, Rockford, Illinois) and diluted to 1 mg/ml with H 2 0. The 
pellet (insoluble fraction) was resuspended with 1 ml of 1% SDS in 25 mM sodium 
tetraborate and diluted such that the same volume of solublized pellet and water 
were used as were used to dilute the corresponding soluble fraction. Each fraction 
was diluted 1:1 with 2x SDS sample buffer (125 mM Tris-HCl, 20% glycerol, 2% SDS 
2% beta-mercaptoethanol, 0.01% bromophenol blue). 10ml of each sample was 
applied to a 12% SDS-PAGE gel and run overnight (approximately 14 hours) at 80 V 
constant voltage. The gel was transferred onto ProBlott Membrane (Applied 
Biosystems #400994) in 10 mM CAPS, pH 11, 10% MeOH for two hours at 400 mA 
(Hoefer TE22). The blots were blocked for one hour to overnight in Tris Buffered 
Saline plus Tween 20 (TBST, 10 mM Tris pH 8.2, 150 mM NaCl, 0.05% Tween 20) 
plus 5% Food Club brand Non Fat Dry Milk (NFDM). The first antibody was an 
affinity purified goat anti- rHb antibody. The antibody was diluted 1:2500 in TBST 
plus 1% NFDM and rocked at room temperature for one hour. Prior to second 
antibody incubation, the blot was washed three times for five minutes per wash in 
TBST. The second "antibody" was Horseradish peroxidase (HRP) linked to Protein G 
(Calbiochem catalogue #539322, San Diego, California). Stock protein G-HRP was 
diluted 1:5000 with TBST plus 0.5% NFDM. Blots were incubated with rocking for 1- 
2 hours. Three final TBST washes of five minutes each were performed. Blots were 
developed for one minute with LumiGLO Substrate (Kirkegaard & Perry 
Laboratory, #54-61-00), covered with plastic wrap and exposed to X-Ray film 
(Amersham Hyperfilm-ECL #RPN. 2103). 

Developed films were scanned on a pdi (Huntington Station, NY) scanning 
densitometer using the white light setting recommended by the manufacturer: The * 
densitometer is pdi's Model DNA 35 connected to a Sun Microsystems SPARC 

3/ 
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Station 2 computer running pdi's Quantity One version 2.4 software. The software 
generates an image of the blot which is then analyzed. Image background was ' 
subtracted along a stripe down the center of each lane, by a "rolling circle" method 
Each band was quantified by average density in the band times the band area. 
Quantitation values are in units of O.D. x mm 2 . Percent soluble recombinant 
hemoglobin (rHb) was calculated for each time point by the formula: 

o/ c t ui i^ru Soluble Value 

/o Soluble Hb = soluble Value 4-Insoluble Value x 100 



10 EXAMPLE 11 

Control of beta-galatosidase production on medium and high copy number 
plasmids by chromosomal lad alleles 

15 Production of beta-galactosidase in strain SGE77 (Stratagene) containing 

either plasmid pSGE712(medium copy number), 714(medium copy number), or 
721 (high copy number) was compared to strain SGE1495 containing either plasmid 
pSGE712, 714, or 721, and several other strains as well, by growing in 25 ml flasks or 
20 ml test tubes as described above. At induction, the cultures were divided into 

20 two equal portions and IPTG was added, to 100 |iM final concentration, to one half 
of the culture. The other was left untreated (control). Incubation was continued for 
both under identical conditions. Beta-galactosidase expression in the IPTG treated 
fractions was monitored to ensure that expression levels were consistently inducible. 
Beta-galactosidase activity was also determined from a sample of the cultures that 

25 were not treated with IPTG. In the absence of inducer, beta-galactosidase expression 
was lowest in strains that contained a lad gene on the plasmid or in any of the 
strains derived from strain SGE1675 which contained a chromosomal laciql allele 
(Table 3). Thus, sufficient Lac repressor was produced from the chromosomal lad 
allele to control expression from Ptac on both medium and high copy number 

30 plasmids which did not contain lad. 
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Table 3. 
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Strain/Description 


Plasmid 


P-galactosidase activity 
(Miller Units) 


no IPTG 


+100uM IPTG 


SGE77/lacIQ 


r>SGF71 9 


3,069 


52,482 


0GE77/lacIQ 


pSGE714 


4,247 


32,783 


SGE77/lacIQ. 


pSGE721 


24,402 


25,141 


SGEU95/lacIQ 


pSGE712 


1,846 


36,511 


SGE1495 /lacIQ 


pSGE714 


3,957 


33,763 


SGEU95/lacIQ 


pSGE721 


9,659 


39,293 


SGE1661//AC/+ 


pSGE721 


20,881 


31,118 


SGE1674/ lacIQl 


pSGE721 


1,203 


39,770 


5GE1675 /lacIQl 


pSGE721 


1,778 


27,124 



15 



laclQ on the chromosome did not control expression of a beta-LacZ fusion 
from high copy number (-500 copies per cell) plasmid (pSGE721). laclQ on the 
chromosome was able to control expression of a medium copy plasmid (-100 copies 
per cell; pSGE714) fairly well. A medium copy number plasmid which contained its 
own lacl gene with the native lacl promoter (pSGE712) was capable of controlling 
expression of this well (-100 copies per cell). 

lacl+ or laclQ on the chromosome did not control expression of a beta-LacZ 
fusion from high copy number (-500 copies per cell) plasmid (pSGE721 in SGE77, 
SGE1469 and SGE1495 above). laclQl on the chromosome was capable of 
controlling expression of the high copy plasmid the best of the strains above. Note 
that SGE1674 and 1675 are sibs from the same transduction. 



EXAMPLE 12 



20 



25 



Control of hemoglobin production on medium and high copy number plasmids 

by chromosomal lacl alleles 

Production of hemoglobin in strain SGE1662 containing plasmid pSGE705 
was compared to production of hemoglobin from SGE1464 containing plasmid 
pSGE720. Cells were grown in 15 L fermentors at 30°C as described above. Samples 
were withdrawn just prior to induction and assayed for hemoglobin levels on a per 
cell and per volume basis. There was no significant difference in hemoglobin 
production prior to induction between SGE1662 and SGE1464. Ten hours after 
induction, samples were again withdrawn and hemoglobin expression was 
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measured. About twice as much hemoglobin was produced in SGE1464 relative to 
SGE1662 (Table 4) even though SGE1464 required approximately three-fold less 
IPTG for induction. 



Table 4. 

Oxy rHbl.l Yields by Zn Capture from 15 L Fermentations with SGE1464 and 

SGE1662. 



SGE 

# 


ferm. Oxy 
(g/U 


IPTG 

(iM 


1464 


1.10±0.29 


15 


1662 


0.58±0.17 


55 



This demonstrated the superiority of the high copy number plasmid 
expression system in which the expression of the heterologous genes, in this case 
those encoding recombinant hemoglobin, are controlled by a regulator 
overproduced from the chromosome (SGE1464). Induction of higher levels of 
soluble globin expression occurred even with lower concentrations of inducer than 
required to induce globin protein expression from the medium copy plasmid 
containing a lad gene on the plasmid (SGE1662). 

The level of soluble rHbl.l produced using various low IPTG concentrations 
for induction was examined. Cells were grown in 15L fermentors at 30°C as 
described above. Ten hours after induction with IPTG concentrations ranging from 
0 to 55|iM, samples were withdrawn and soluble hemoglobin expression was 
measured. Significant induction of expression was observed with the lowest IPTG 
concentration used, 5.5jaM (Table 5), while no hemoglobin was produced in the 
absence of IPTG. The hemoglobin concentration produced increased only 
marginally when the IPTG concentration was above 7.5^iM (Table 5). The expression 
of rHbl.l, controlled on a high copy plasmid by Lac Repressor produced from the 
Icicle allele on the chromosome, was sensitive to induction by low IPTG 
concentrations, a significant advantage over other expression systems. 
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Table 5. 

Soluble rHbl.l from SGE1464 induced by various IPTG concentrations. 



uM IPTG 


g/L rHbl 1+S D 


0 


0.02+0.01 


5.5 


0.35 


7.5 


0.8 


15 


0.73 


22.5 


0.91+0.28 


55 


0.9 



EXAMPLE 13 

Effect of Temperature on soluble di-alpha hemoglobin expression in SGE1464 

The effect of temperature on expression levels of soluble expression of di- 
alpha hemoglobin (described in co-pending PCT application WO90/13645 to 
Hoffman et al.) in strain SGE1464 containing pSGE720 was examined by growing the 
cells in 15 L fermentors at various temperatures. Soluble hemoglobin was measured 
using the IMAC procedure described above. Soluble hemoglobin expression was 
enhanced when the cultures were grown at lower temperatures. 28°C significantly 
improved soluble hemoglobin expression for the high copy number system 
(SGE1464 -Table 6). 

Expression was not enhanced for the strain containing the medium copy 
number plasmid which also includes the Lac Repressor gene on the plasmid (Table 
6; SGE1662). Because the high copy number plasmid facilitates production of more 
rHbl.l due to the 4- to 5-fold increase in rHbl.l gene dosage (on the high copy 
number plasmid, relative to the medium copy plasmid; 500 versus 100 copies per 
cell approximately), the effect of temperature on solubility of the synthesized protein 
is manifest. - SGE1464 produces a significant excess of protein which is insoluble and 
transparent to the IMAC assay. We identify the insoluble material by a Western Blot 
assay (supra) using an antibody specific to rHbl.l. SGE1662 does not produce 
insoluble rHbl.l. All the globin synthesized is soluble. SGE1464 produces a 
substantial proportion of insoluble globin protein, usually approximately equivalent 
to the amount of soluble protein. Therefore, temperature, which is known to 
improve solubility and/or stability of some proteins, can only affect the strain with a 
reservior of insoluble rHbl.l, SGE1464. Overproduction of the protein through 
increased gene dosage and improved inducibility created an opportunity for . ■ 
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increased soluble globin expression that lower temperatures allowed further - 
exploitation. 

Table 6. 



SGE 
# 


ferm. Oxy 
(g/D 


ferm. 
temp. 


1464 


1.10+0.29 


30°C 


1464 


1.50±0.21 


28°C 


1662 


0.58±0.17 


30°C 


1662 


0.51±0.11 


28°C 



5 

In an additional study, the proportion of soluble rHbl.l protein produced 
from the high copy number system (SGE1464) increased inversely with temperature 
(Table 7). At the lowest temperature tested, 24°C, an estimated 70% of the globin 
protein synthesized was soluble, while at 30°C, only approximately 34% was soluble 
10 (Table 7). We expect that this relationship is expected to continue if additional 

experiments extended the temperature extremes beyond the tested range, but the 
optimal yield is likely to fall within range of temperatures tested since the highest 
peak yield per volume of culture was obtained at 26°C (Table 7) with lower yields at 
either extreme of lower or higher temperatures. 

15 

Table 7. 



Temp. (°C) 


Ave. % sol. 


Ave. peak g.L 


24 


70 


1.10 


26 


62 


1.62 


28 


53 


1.28 


30 


34 


0.83 



EXAMPLE 14 

20 

Effect of Temperature on soluble hemoglobin expression in SGE1483 

The effect of temperature on expression levels of soluble expression of wild- 
type hemoglobin containing a replacement of the N-terminal valine with methionine 
25 for all the globins (described in co-pending PCT application WO90/ 13645 to 

Hoffman et al.) in strain SGE1483 containing pSGE728 was examined by growing the 
cells in 15 L fermentors at various temperatures. The plasmid, pSGE728, is the same 

3G 



WO 97/04110 



PCT/US96/11600 



as PSGE720 except it contains only one alpha gene (i.e., a mono-alpha gene) instead 
of the di-alpha gene of P SGE720. SGE1483 is the same as strain SGE1464 except 
SGE1483 contains pSGE728 instead of pSEG720. 

Soluble hemoglobin was measured using the IMAC procedure described 
above. Soluble hemoglobin expression was enhanced when the cultures were grown 
at lower temperatures. 28°C significantly improved soluble hemoglobin expression 
for the high copy number system (SGE1483 -Table 8). 



10 



Table 8. 



Temp. 
(°C) 


g/L SGE1483 
(rHbO.O) 


30 


1.60 + 0.17 


28 


2.16 ± 0.29 



EXAMPLE 15 
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Table 9. 



lad 
allele 


plasmid 


plasmids per 
cell 


LacI per 
cell 


LacIrPlasmid ratio 


lacIQ 


pSGE705 


100 


1100 


10:1 


lacIQ 


pSGE720 


500 


100 


1:5 (no control) 


lacIQ 


pSGE715 


500 


5100 


10:1 


lacIQl 


pSGE720 


500 


1700 


3:1 



The proportional relationship (stoichiometry) of the regulatory protein and 
the DNA binding site to which it binds, helps determine the level of control of gene 
expression in the absence of inducing conditions, and the amount of inducer 
required for full induction of gene expression from the promoter so-controlled. In 
the example system described in Table 9, LacI is the regulatory protein and the 
plasmid contains one DNA binding site for LacI on each copy in the cell. The 
plasmid copy number is estimated as described (supra), and the LacI level is 
estimated from relative transcription levels (infra). The ratio of LacI to plasmid must 
exceed 1:1 in order to control expression of gene(s) whose transcription is regulated 
by LacI, on every plasmid in the cell. The only example in the Table 9 which does 
not produce a ratio greater than 1:1 is the presence of a high copy number plasmid 
(pSGE720) in a strain having only laclQ on the chromosome. This should result in 
no control of expression (i.e. the expression is induced even in the absence of IPTG 
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or other inducer) from at least some plasmids in the cell, since there is not enough 
Lad to go around. Two actual examples of this senario are shown in Table 3, in 
which strains containing only lacIQ (SGE77 and SGE1495) failed to repress a high 
copy number plasmid, pSGE721, in the absence of inducer. 

Since stoichiometry is important, too much regulatory protein needs to be 
avoided if possible. If the strength of a promoter, or the gene dosage for a gene 
encoding a regulatory protein is increased too much, then the abundance of the 
regulatory protein may be far greater than the number of sites to which it can bind, 
even on a high copy number plasmid. In this case, it may require a large amount of 
inducer to activate transcription of the heterologous gene(s), often an undesirable 
situation. A large excess of regulatory proteins can impair induction at low inducer 
concentrations. Inside the cell, enough repressor remains unaffected by inducer to 
repress most, if not all, the promoters on the plasmids controlled by Lad. Only high 
concentrations of IPTG can dislodge all the Lac repressor from its cognate binding 
15 sites on the DNA. 

An excess of Lac Repressor is produced by plasmids containing a lad gene, 
due to the increased dosage of the gene. In Table 10, the chromosomal expression of 
LacZ, also controlled by Lac Repressor, indicates the induction achieved by IPTG 
addition in the same way as globin gene expression, but is a much more sensitive 
20 and easily assayed indicator than globin expression. The LacZ expression, as a 
percent of that achieved by l,000f*M IPTG addition for SGE1662 versus SGE1464, 
demonstrates the induction of expression by low IPTG concentrations when lower 
LacLplasmid ratios exist. In SGE1464, with an estimated ratio of LacLplasmid of 
only 3:1, expression of chromosomal LacZ was induced at much lower 
25 concentrations of IPTG than was necessary to induce equivalent expression from 
SGE1662, which has an estimated ratio of LacLplasmid of 10:1. Similar proportions 
of rHbl.l are induced from SGE1464 at IPTG concentrations similar to that required 
for chromosomal LacZ induction (see above), indicating that this is an appropriate 
and accurate surrogate, since both are controlled by Lac Repressor in the same cell. 
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Table 10. 



Percent Induction of chromosomal LacZ at various IPTG concentrations in the 
high copy number (1464) and medium copy number (1662) rHbl.l-producing 

strains. 



10 



15 



20 



25 



30 



uM IPTG 


1662 


1464 


3 


0.2 


21.3 


10 


0.2 


44 


30 


1.6 


81.7 


100 


22.9 


88.8 


1.00 


100 


100 



EXAMPLE 16 

Characterization of chromosomal lad alleles 

DNA Isolation, PCR, Cloning, and Sequencing. Genomic DNA from strain C600 
(E. coh Genetic Stock Center) was isolated from cells lysed by a freeze-thaw 
lysozyme method. After SDS and proteinase K treatment, DNA was purified by 
phenol extraction and EtOH precipitation. The DNA was spooled from the EtOH 
solutions and rinsed. A map-based Polymerase Chain Reaction (PCR) cloning 
method was used to amplify the region upstream of the lad gene. The Kohara map 
predicts that there is a PstI site at about 600 bp upstream from the lad promoter. A 
12 nucleotide-long oligo with a degenerate 5' half and the recognition site for Pst I at 
ifs 3' end, and an oligonucleotide complementary to a region just downstream from 
the rrutiatation of translation site in the lad gene (CBG23): AGT CAA GCT TAA 
CGT GGC TGG CCT GGT T were used in the PCR reaction. CBG23 contained the 
recognition site for Hindlll. 50 pmole of CBG24 (5'-NNNNNNCTGCAG-3') and 10 
pmole of TG45 (5'-CTGGCACCCAGTTGATCG-3') were used per 100 pi reaction. 
Pfu polymerase (Stratagene, La Jolla, CA) was used. 200 ng of purified C600 DNA 
was used. Reactions were performed in the Ericomp. The first cycle of the program 
consisted of 10 min; at 95°C, 5 min annealing at 50°C, 2 min extension at 72°C. 33 
cycles consisting of 1 min at 95°C, 30 sec at 50°C, and 1 min at 72°C followed. The 
final cycle was 1 min at 95°C, 30 sec at 50°C, and 10 min at 72°C Products were 
cleaned with the Magic PCR clean-up kit (Promega, Inc., Madison, WI) and digested 
overnight with PstI and Hindlll. Digested DNA was cleaned again and ligated into 
P UC19 plasmid DNA that had been previously digested with PstI and HindUI. 
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Ligation mixtures were transformed into JM109 (Promega, Inc.). Colonies 
containing inserts were identified and the the inserts were sequenced using the 
Sequenase, v.2.0 kit (United States Biochemical). Internal sequencing primers were 
used to complete the sequence of the inserts. A PCR primer containing a PstI site 
5 and homologous to the region 200 bp upstream from the lad promoter was designed 
(CBG30): TAT ACT GCA GGG TAT TGG CTG TCT GAA T. Genomic DNA from 
RV308 (SGE2606) (Mauer et al. 1980. J. Molec. Biol. 139:147-161 and Meyer et al. 
1980. J. Molec. Biol. 139:163-194), SGE1661, SGE1670, and SGE1675 was prepared as 
above. CBG23 and CBG30 were used to amplify the lad promoter region. The PCR 
10 products were cloned as before into the Psfl and Hindlll sites of pUCl9 or pBC SK+ 
(Stratagene; La Jolla, CA). Insert-containing clones were identified by restriction 
digest and inserts were sequenced as before. 

RNA Isolati on and Primer-Extension Analysis. 

15 RNA was isolated from bacterial cells by a hot-phenol extraction method. 

Cells were grown in shake flasks in M9 minimal media at 37°C. Primers were 
radiolabeled with T^P-ATP. Unincorporated label was not removed before 
annealing primer to RNA. RNA and labeled primer were mixed in annealing buffer 
and heated to 65°C for 10 minutes, followed by 50°C for 40 minutes. The mixture 

20 was then placed on ice and incubated for 15 minutes. Superscript II (Stratagene, 
Inc.) was used to synthesize cDNA according to manufacturer's instructions except 
for the following: 100 p.g/ml vanadyl-ribonucleoside complex, 10 mM actinomycin 
D, RNasin were added. Reactions were precipitated with sodium acetate and 
ethanol. The precipitate was collected and washed with 70% ethanol and then dried. 

25 Pellets were resuspended in 3 ul water and 5 \i\ of sample buffer was added. 
Samples were heated at 65 °C for two minutes and then loaded on a 6% 
polyacrylamide sequencing gel adjacent to sequencing reactions of a plasmid with 
wild-type lad that were performed using the same primer. 

30 Cloning and Sequence of Lad Promoter Regions. Lad promoter region PCR 

products from strains SGE1661 an SGE1675 show two and sometimes three bands 
for most strains. RV308 has a large deletion of the Lac operon region but still has the 
upper band which was therefore believed to be an artefact of the PCR. Sequence 
analysis of this band revealed that it had a sequence unrelated to Lad. No sequence 

35 matches were found in database comparisons. The lowest bands are also believed to 
be artefacts. Two sizes of intermediate bands were observed. The lower was 
estimated to be 10 to 15 basepairs shorter than the upper. All strains which 
exhibited good control of expression had the lower band. An unanticipated result 
for PCR of SGE 1675, which had good control of expression, was the presence of 
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both bands indicating that at least a partial duplication of the Lad region may be 
present in this strain. PCR products were cloned into PUC19 or pBC SK+ at the 
Hindlll and Pstl sites. Previously published Lad promoter region sequence data 
(Farabough, 1978, Nature 274:765-769 and Calos, 1978, Nature. 274-762-765) extends 
only 50 basepairs upstream of the start of transcription. Approximately 600 
basepairs of additional sequence was obtained from C600. Smaller regions were 
cloned and sequenced from SGE1661, SGE1670, and SGE1675. The sequence of the 
LadW allele is shown in Calos & Miller, 1981, supra. The upper, intermediate PCR 
band had a sequence similar to the wild-type with two minor, probably strain- 
related, differences. The lower band from SGE1670 and SGE1675 had a 15 basepair 
deletion. This creates a new -35 region which has the canononical sequence. This 
sequence is nearly identical to the published LadQX allele. 

Kohara Map-Based Cloning Method for Escherichia coli DNA Regions. 
A method utilizing Polymerase Chain Reaction (PCR) to amplify regions from 
purified DNA is described below. The Kohara E. coli restriction map (Kohara, et al, 
1987) and the known sequence (Calos, 1978, and Farabaugh, 1978) of the lad gene ' 
were used to design two oligonucleotide primers for the PCR. 

Oligonucleotide 2 (CBG24) had the sequence 5'NNNNNNCTGC AG3 ' and 
20 should prime on any Psfl site in the genome. Oligonucleotide 1 (TG45) 

corresponded to known lad sequence near the 5' end of the gene. In the first step, 
these two oligos were used to amplify a fragment of about 800 base pairs from E. coli 
strain C600 genomic DNA. Oligo 1 determines the specificity in this case. It is 
believed important to use a larger amount (50 pmole/100 ul reaction) of oligo 2 in 
order to compensate for the redundancy. 10 pmole/100 ul reaction of oligo 1 were 
used. This first PCR product was used in a second round of PCR which provides a 
second test of specificity. Oligo 2 plus a second internal lac/ oligo, oligo 3 (CBG23) 
were used to amplify a subset of this region. Oligo 3 has a Hindlll recognition site to 
allow cloning of this reaction product. PCR products were digested with Pstl and 
Hindlll and cloned into pUC19 which was also digested with these enzymes. Clones 
with inserts were identified and sequencing was begun. Primers complementary to 
the vector and to known lad sequence were used for the first reactions. Sequence 
determined from these reactions was used to design more oligonucleotides in order 
to complete the sequence. 

Genomic DNA was then purified from SGE1661, SGE1670, and SGE1675. A 

primer that is 200 bp upstream from the lad promoter was designed. A Hindlll site 

was added to this oligonucleotide (CBG30). CBG23 and CBG30 were used to 

amplify the Lad promoter region from these three strains. PCR products were 

analysed by agarose gel electrophoresis where it was observed that the product from 
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SGE1670 had a small deletion relative to the product from SGE1661 (wild-type). In 
addition, two products, one of each size, were obtained from SGE1675. The PCR 
products were digested with Pstl and Hindlll and cloned into pBC SK+ at the Psfl 
and Hindlll sites. Candidates with inserts were identified and these were 
5 sequenced. It was determined that the smaller PCR product had nearly the same 
sequence as the LaclQl allele reported in 1981 by Calos and Miller. The larger 
product had the wild-type sequence. 

10 References: 

Calos, MP. 1978. DNA sequence for a low-level promoter of the lac repressor and 
an "up" promoter mutation. Nature 274: 762-765. 

15 Calos, M.P. and J.H. Miller. 1981. The DNA sequence change resulting from the iQ 1 
mutation, which greatly increases promoter strength. Mol. Gen. Genet. 183: 559- 
560. 

Farabaugh, P.J. Sequence of the Lad gene. Nature 274: 765-769. 

20 

Kohara, Y., K. Akyama, and K. Isono. 1987. The physical map of the whole E. coli 
chromosome: application of a new strategy for rapid analysis and sorting of a large 
genomic library. Cell 50: 495-508. 

25 Measurement of Lad Transcript Level. 

Lad transcript levels were assessed in JM109, SGE299, SGE1661, SGE1662, 
SGE1670, SGE1675. The results show that transcription levels in SGE299, SGE1670, 
and SGE1675 are similar to each other and are greater than in SGE1661, JM109 and 
SGE1662. The Lad transcript initiates at the same point in all strains. Quantitation 

30 of autoradiograms showed that the relative abundance of transcript of Lad in 

JM109, SGE299, SGE1662, SGE1670, SGE1675, respectively, was 1:5:7:17:17. JM109 
contains larfQ on an F' factor which, is reported to result in —100 copies of Lad per 
cell (Calos, M.P. ibid). SGE299 contains laclQ 1 on an F factor, which since it has 5- 
fold more transcript than JM109, must therefore contain -500 copies of Lad per cell. 

35 SGE1662 contains a lad gene on the chromosome and on each of the estimated 100 
copies of the expression plasmid, and would be estimated from the literature to 
have -1,010 copies of Lad per cell. Since the transcript was 7-fold greater than 
measured from JM109, we would estimate -700 copies of Lad per cell, in good 
accord with the expected -1010 copies of Lad per cell. Both SGE1670 and SGE1675 

40 contain lacIQl on the chromosome, and since they have 17-fold more transcript than 
JM109, must therefore contain -1,700 copies of Lad per cell. Relative transcript 
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levels is one way to approximate the amount of regulator protein expected from 
mutations that increase promoter strength. 

EXAMPLE 17 

Effect of Exogenous Addition of Hemin on Soluble Hemoglobin 

Expression in SGE1483 

The effect of addition of higher than normal concentrations of exogenous 
hemin on the soluble hemoglobin-like molecule, rHbO.O, accumulation in strain 
SGE1483 was examined by growing cells in 15 L fermentations for extended periods 
of time post-induction, with two different hemin feeding strategies. Soluble 
hemoglobin was measured using the IMAC procedure described above. Soluble 
hemoglobin was enhanced when additional hemin supplementation was included 
durmg extended incubation of the culture post addition of IPTG inducer (Table 11). 

Induction of hemoglobin expression in 15L fermentations is normally 
supplemented by hemin additions at the time of induction, and at three and six 
hours post-induction. The volumes of hemin solution at each supplementation are 
10, 13 and 17mls respectively, which result in a total hemin addition equal to 0.2g/L 
hemin in the fermentation tank. Additional hemin supplementation was achieved 
by a second and third 17ml addition at 9 and 12 hours post-induction during 16 hour 
induction incubations, resulting in a total of 0.37g/L hemin in the fermentation tank 
The difference in soluble rHbO.O levels at 10 hours post induction was not significant 
(Table 11), only after 16 hours post-induction did the additional hemin 
25 supplementation manifest a significant improvement in soluble rHbO.O 

accumulation (Table 11). Hemin is hydrophobic, and known to precipitate in 
aqueous solutions. Therefore, the additional supplementation may help accumulate 
higher levels of rHbO.O by increasing the total concentration, by maintaining a 
consistent soluble hemin level throughout the fermentation, or through other 
mechanisms. Supplementation strategies for hemin might also include , among the 
options, different bolus additions and timing of additions, continuous feeding, very 
high initial bolus additions, and alternative chelated hemin for improved solubility. 



20 



30 



<WO 070411041 I * 



WO 97/04110 



PCT/US96/41600 



Table 11 



Total g/L 
Hemin 
Added 


g/L soluble rHbO.O @ 
hours Post Induction 


10 


16 


0.20 


2.00±0.10 


1.52+0.02 


0.37 


2.23±0.32 


2.58+0.27 



5 EXAMPLE 18 

Effect of Exogenous Addition of Hemin on Soluble Hemoglobin Expression in 

SGE1464 

10 The effect of addition of higher than normal concentrations of exogenous 

hemin on soluble hemoglobin-like protein, rHbl.l accumulation in strain SGE1464 
was examined by growing cells in 15 L fermentations for extended periods of time 
post-induction, with two different hemin feeding strategies. Soluble hemoglobin 
was measured using the IMAC procedure described above. Soluble hemoglobin 

15 was enhanced when additional hemin supplementation was included during 
extended incubation of the culture post-addition of IPTG inducer (Table 12). 

Induction of hemoglobin expression in 15L fermentations is normally 
supplemented by hemin additions at the time of induction, and at three and six 
hours post induction. The volumes of hemin solution at each supplementation are 

20 10, 13 and 17mls respectively, which result in a total hemin addition equal to 0.2g/L 
hemin in the fermentation tank. Additional hemin supplementation was achieved 
by a second and third 17ml addition at 9 and 12 hours post-induction during 16 hour 
induction incubations, resulting in a total of 0.37g/L hemin in the fermentation tank. 
The difference in soluble rHbl.l levels at 10 hours post-induction was not significant 

25 (Table 12), only after 16 hours post-induction did the additional hemin 
supplementation manifest a significant improvement in soluble rHbl.l 
accumulation (Table 12), as seen for rHbO.O above. 



WO 97/04110 



PCT/US96/11600 



Table 12 



Total g/L 
Hemin 
Added 


g/L soluble rHbl.l @ 
hours Post Induction 


10 


16 


0.20 


1.14+0.21 


1.53+0.28 


0.37 


1.28+0.23 


1.95+0.26 



documents referred to herein are hereby incorporated by reference. 



BNSOOClD:<WO 97O4110A1 I > 



WO 97/04110 



PCT/US96/11600 



What is Claimed is: 



A prokaryotic host cell for producing a heterologous polypeptide comprisin 
a medium or high copy number plasmid comprising a regulatable 
expression unit encoding said polypeptide; and 

a chromosomally-located gene encoding a regulatory protein capable 
of regulating said regulatable expression unit, expression of said 
regulatory protein being controlled by a strong promoter. 

A prokaryotic host cell according to claim 1, wherein said plasmid is a high 
copy number plasmid. 



A prokaryotic host cell according to claim 1, wherein said expression is 
repressive, said regulatory protein is a repressor capable of repressing said 
repressible expression unit, and expression of said repressor is controlled by a 
strong repressor promoter. 

A host cell according to claim 1, selected from the group consisting of 
Escherichia, Salmonella, Bacillus, Clostridium, Streptomyces, Staphylococcus, 
Neisseria, Lactobacillus, Shigella, and Mycoplasma. 

A host cell according to claim 4, wherein said host cell is £. coli 

A host cell according to claim 1, wherein said regulatable expression unit is 
selected from the group consisting ofia £i tac x tr^tr^ara J fru J1 gal and mal- 

A host cell according to claim 6, wherein said regulatable expression unit is 
lac . 



A host cell according to claim 5, wherein said expression unit is lac. 

A host cell according to claim 3, wherein said repressor is selected from the 
group consisting of repressors of lac, tac, trc, trg, ara, fno, gal and mal . 

A host cell according to claim 9, wherein said cell is E. coli and said repressor 
is Lacl. 
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11. 



A host cell according to claim 10, wherein said strong repressor promoter is 
laclQ 1 . 



12. 



A host cell according to claim 11, wherein the ratio of laclQl to said plasmid 
5 is about 3:1. 



13. 



10 14. 



16. 



20 17. 



25 



30 



18. 



21. 



A host cell according to claim 10, wherein said high copy number plasmid 
selected from the group consisting of pSGE720, pSGE726 and pSGE728. 



is 



A host cell according to claim 3, wherein said host cell is E. coli, said 
repressible expression unit is lac-based, said repressor is Lad, and said strong 
repressor promoter is laclQ 1 . 



15. A host cell according to claim 14, wherein said heterologous polypeptide 
15 recombinant hemoglobin. 



is 



A host cell according to claim 15, wherein said recombinant hemoglobin is 
rHbl.l. 

A method of producing a heterologous polypeptide comprising culturing, in 
the presence of an inducer, a bacterial host cell comprising: 

a medium or high copy number plasmid comprising a regulatable 

expression unit encoding said polypeptide; and 

a chromosomally-located gene encoding a regulatory protein capable 
of regulating said regulatable expression unit, expression of said 
regulatory protein being controlled by a strong promoter. 

A method according to claim 17, wherein said plasmid is a high copy number 
plasmid. 



19. A method according to claim 17, wherein said host cell is selected from the 
group consisting of Escherichia, Salmonella, Bacillus, Clostridium , Streptomyces, 
Staphylococcus, Neisseria, Lactobacillus, Shigella, and Mycoplasma. 

35 20 A method according to claim 19, wherein said host cell is £. coli. 



A method according to claim 20, wherein said regulatable expression unit is 
lac-based, said regulatory protein is Lad, and said strong promoter is laclQl. 

V7 
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22. A method according to claim 17, wherein said regulatory protein is a - 

repressor selected from the group consisting of repressors of lac, tac, trc, tip, 
ara, fru, gal and mal . 

5 23. A method according to claim 17, wherein said regulatable expression unit is 
selected from the group consisting of lac, tac, trc, trp, ara, fru, gal and mal. 

24. A method according to claim 17, wherein said heterologous polypeptide is 
recombinant hemoglobin. 

10 

25. A method according to claim 24, wherein said recombinant hemoglobin is 
rHbl.l. 

26. A method according to claim 17, wherein said expression is repressible, said 
15 regulatory protein is a repressor capable of repressing said repressible 

expression unit, and expression of said repressor is controlled by a strong 
repressor promoter. 

27. A method according to claim 17, wherein said host is cultured at a 
20 temperature of about 20°C to about 30°C 

28. A method according to claim 27, wherein said temperature is about 24°C to 
about 28°C. 

25 29. A method according to claim 28, wherein said temperature is about 26°C. 

30. A high copy number plasmid selected from the group consisting of pSGE720, 
pSGE726 and pSGE728. 

30 31. A plasmid according to claim 30, wherein said plasmid is pSGE720. 
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